1,527 results on '"Gupta, Abhishek"'
Search Results
2. Prevalence of Gastrointestinal Helminthic Infections in Black Bucks (Antilope cervicapra) of Tal Chhapar Sanctuary of Rajasthan
- Author
-
Chouhan, A.K., Pilania, P.K., Monika, Rathore, Bhavana, Gupta, Abhishek, Sodha, D.B., and Pateer, D.P.
- Published
- 2022
- Full Text
- View/download PDF
3. Physics-Informed Neuro-Evolution (PINE): A Survey and Prospects
- Author
-
Wong, Jian Cheng, Gupta, Abhishek, Ooi, Chin Chun, Chiu, Pao-Hsiung, Liu, Jiao, and Ong, Yew-Soon
- Subjects
Computer Science - Neural and Evolutionary Computing ,Computer Science - Computational Engineering, Finance, and Science ,Computer Science - Machine Learning - Abstract
Deep learning models trained on finite data lack a complete understanding of the physical world. On the other hand, physics-informed neural networks (PINNs) are infused with such knowledge through the incorporation of mathematically expressible laws of nature into their training loss function. By complying with physical laws, PINNs provide advantages over purely data-driven models in limited-data regimes. This feature has propelled them to the forefront of scientific machine learning, a domain characterized by scarce and costly data. However, the vision of accurate physics-informed learning comes with significant challenges. This review examines PINNs for the first time in terms of model optimization and generalization, shedding light on the need for new algorithmic advances to overcome issues pertaining to the training speed, precision, and generalizability of today's PINN models. Of particular interest are the gradient-free methods of neuroevolution for optimizing the uniquely complex loss landscapes arising in PINN training. Methods synergizing gradient descent and neuroevolution for discovering bespoke neural architectures and balancing multiple conflicting terms in physics-informed learning objectives are positioned as important avenues for future research. Yet another exciting track is to cast neuroevolution as a meta-learner of generalizable PINN models., Comment: 20 pages, 8 figures, 1 table
- Published
- 2025
4. Tiered Acquisition for Constrained Bayesian Optimization: An Application to Analog Circuits
- Author
-
Rashid, Ria and Gupta, Abhishek
- Subjects
Computer Science - Neural and Evolutionary Computing - Abstract
Analog circuit design can be considered as an optimization problem with the targeted circuit specifications as constraints. When stringent circuit specifications are considered, it is desired to have an optimization methodology that adapts well to heavily constrained search spaces. To this end, we propose a novel Bayesian optimization algorithm with a tiered ensemble of acquisition functions and demonstrate its considerable application potential for analog circuit design automation. Our method is the first to introduce the concept of multiple dominance among acquisition functions, allowing the search for the optimal solutions to be effectively bounded \emph{within} the predicted set of feasible solutions in a constrained search space. This has resulted in a significant reduction in constraint violations by the candidate solutions, leading to better-optimized designs within tight computational budgets. The methodology is validated in gain and area optimization of a two-stage Miller compensated operational amplifier in a 65 nm technology. In comparison to robust baselines and state-of-the-art algorithms, this method reduces constraint violations by up to 38% and improves the target objective by up to 43%. The source code of our algorithm is made available at https://github.com/riarashid/TRACE.
- Published
- 2024
5. STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning
- Author
-
Memmel, Marius, Berg, Jacob, Chen, Bingqing, Gupta, Abhishek, and Francis, Jonathan
- Subjects
Computer Science - Robotics ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Systems and Control - Abstract
Robot learning is witnessing a significant increase in the size, diversity, and complexity of pre-collected datasets, mirroring trends in domains such as natural language processing and computer vision. Many robot learning methods treat such datasets as multi-task expert data and learn a multi-task, generalist policy by training broadly across them. Notably, while these generalist policies can improve the average performance across many tasks, the performance of generalist policies on any one task is often suboptimal due to negative transfer between partitions of the data, compared to task-specific specialist policies. In this work, we argue for the paradigm of training policies during deployment given the scenarios they encounter: rather than deploying pre-trained policies to unseen problems in a zero-shot manner, we non-parametrically retrieve and train models directly on relevant data at test time. Furthermore, we show that many robotics tasks share considerable amounts of low-level behaviors and that retrieval at the "sub"-trajectory granularity enables significantly improved data utilization, generalization, and robustness in adapting policies to novel problems. In contrast, existing full-trajectory retrieval methods tend to underutilize the data and miss out on shared cross-task content. This work proposes STRAP, a technique for leveraging pre-trained vision foundation models and dynamic time warping to retrieve sub-sequences of trajectories from large training corpora in a robust fashion. STRAP outperforms both prior retrieval algorithms and multi-task learning methods in simulated and real experiments, showing the ability to scale to much larger offline datasets in the real world as well as the ability to learn robust control policies with just a handful of real-world demonstrations., Comment: Project website at https://weirdlabuw.github.io/strap/
- Published
- 2024
6. Robot Learning with Super-Linear Scaling
- Author
-
Torne, Marcel, Jain, Arhan, Yuan, Jiayi, Macha, Vidaaranya, Ankile, Lars, Simeonov, Anthony, Agrawal, Pulkit, and Gupta, Abhishek
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Scaling robot learning requires data collection pipelines that scale favorably with human effort. In this work, we propose Crowdsourcing and Amortizing Human Effort for Real-to-Sim-to-Real(CASHER), a pipeline for scaling up data collection and learning in simulation where the performance scales superlinearly with human effort. The key idea is to crowdsource digital twins of real-world scenes using 3D reconstruction and collect large-scale data in simulation, rather than the real-world. Data collection in simulation is initially driven by RL, bootstrapped with human demonstrations. As the training of a generalist policy progresses across environments, its generalization capabilities can be used to replace human effort with model generated demonstrations. This results in a pipeline where behavioral data is collected in simulation with continually reducing human effort. We show that CASHER demonstrates zero-shot and few-shot scaling laws on three real-world tasks across diverse scenarios. We show that CASHER enables fine-tuning of pre-trained policies to a target scenario using a video scan without any additional human effort. See our project website: https://casher-robot-learning.github.io/CASHER/
- Published
- 2024
7. AMPS: ASR with Multimodal Paraphrase Supervision
- Author
-
Parulekar, Amruta, Gupta, Abhishek, Chattopadhyay, Sameep, and Jyothi, Preethi
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
Spontaneous or conversational multilingual speech presents many challenges for state-of-the-art automatic speech recognition (ASR) systems. In this work, we present a new technique AMPS that augments a multilingual multimodal ASR system with paraphrase-based supervision for improved conversational ASR in multiple languages, including Hindi, Marathi, Malayalam, Kannada, and Nyanja. We use paraphrases of the reference transcriptions as additional supervision while training the multimodal ASR model and selectively invoke this paraphrase objective for utterances with poor ASR performance. Using AMPS with a state-of-the-art multimodal model SeamlessM4T, we obtain significant relative reductions in word error rates (WERs) of up to 5%. We present detailed analyses of our system using both objective and human evaluation metrics.
- Published
- 2024
8. Deep Learning for THz Channel Estimation and Beamforming Prediction via Sub-6GHz Channel
- Author
-
Bhattacharya, Sagnik and Gupta, Abhishek K.
- Subjects
Electrical Engineering and Systems Science - Signal Processing ,Computer Science - Artificial Intelligence - Abstract
An efficient channel estimation is of vital importance to help THz communication systems achieve their full potential. Conventional uplink channel estimation methods, such as least square estimation, are practically inefficient for THz systems because of their large computation overhead. In this paper, we propose an efficient convolutional neural network (CNN) based THz channel estimator that estimates the THz channel factors using uplink sub-6GHz channel. Further, we use the estimated THz channel factors to predict the optimal beamformer from a pre-given codebook, using a dense neural network. We not only get rid of the overhead associated with the conventional methods, but also achieve near-optimal spectral efficiency rates using the proposed beamformer predictor. The proposed method also outperforms deep learning based beamformer predictors accepting THz channel matrices as input, thus proving the validity and efficiency of our sub-6GHz based approach., Comment: Published: 2022 IEEE International Conference on Signal Processing and Communications (SPCOM 2022)
- Published
- 2024
- Full Text
- View/download PDF
9. Learning to Cooperate with Humans using Generative Agents
- Author
-
Liang, Yancheng, Chen, Daphne, Gupta, Abhishek, Du, Simon S., and Jaques, Natasha
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Multiagent Systems - Abstract
Training agents that can coordinate zero-shot with humans is a key mission in multi-agent reinforcement learning (MARL). Current algorithms focus on training simulated human partner policies which are then used to train a Cooperator agent. The simulated human is produced either through behavior cloning over a dataset of human cooperation behavior, or by using MARL to create a population of simulated agents. However, these approaches often struggle to produce a Cooperator that can coordinate well with real humans, since the simulated humans fail to cover the diverse strategies and styles employed by people in the real world. We show \emph{learning a generative model of human partners} can effectively address this issue. Our model learns a latent variable representation of the human that can be regarded as encoding the human's unique strategy, intention, experience, or style. This generative model can be flexibly trained from any (human or neural policy) agent interaction data. By sampling from the latent space, we can use the generative model to produce different partners to train Cooperator agents. We evaluate our method -- \textbf{G}enerative \textbf{A}gent \textbf{M}odeling for \textbf{M}ulti-agent \textbf{A}daptation (GAMMA) -- on Overcooked, a challenging cooperative cooking game that has become a standard benchmark for zero-shot coordination. We conduct an evaluation with real human teammates, and the results show that GAMMA consistently improves performance, whether the generative model is trained on simulated populations or human datasets. Further, we propose a method for posterior sampling from the generative model that is biased towards the human data, enabling us to efficiently improve performance with only a small amount of expensive human interaction data.
- Published
- 2024
10. Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL
- Author
-
Wagenmaker, Andrew, Huang, Kevin, Ke, Liyiming, Boots, Byron, Jamieson, Kevin, and Gupta, Abhishek
- Subjects
Computer Science - Machine Learning ,Computer Science - Robotics ,Statistics - Machine Learning - Abstract
In order to mitigate the sample complexity of real-world reinforcement learning, common practice is to first train a policy in a simulator where samples are cheap, and then deploy this policy in the real world, with the hope that it generalizes effectively. Such \emph{direct sim2real} transfer is not guaranteed to succeed, however, and in cases where it fails, it is unclear how to best utilize the simulator. In this work, we show that in many regimes, while direct sim2real transfer may fail, we can utilize the simulator to learn a set of \emph{exploratory} policies which enable efficient exploration in the real world. In particular, in the setting of low-rank MDPs, we show that coupling these exploratory policies with simple, practical approaches -- least-squares regression oracles and naive randomized exploration -- yields a polynomial sample complexity in the real world, an exponential improvement over direct sim2real transfer, or learning without access to a simulator. To the best of our knowledge, this is the first evidence that simulation transfer yields a provable gain in reinforcement learning in settings where direct sim2real transfer fails. We validate our theoretical results on several realistic robotic simulators and a real-world robotic sim2real task, demonstrating that transferring exploratory policies can yield substantial gains in practice as well., Comment: NeurIPS 2024
- Published
- 2024
11. Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR
- Author
-
Gupta, Abhishek, Parulekar, Amruta, Chattopadhyay, Sameep, and Jyothi, Preethi
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
Automatic speech recognition (ASR) for low-resource languages remains a challenge due to the scarcity of labeled training data. Parameter-efficient fine-tuning and text-only adaptation are two popular methods that have been used to address such low-resource settings. In this work, we investigate how these techniques can be effectively combined using a multilingual multimodal model like SeamlessM4T. Multimodal models are able to leverage unlabeled text via text-only adaptation with further parameter-efficient ASR fine-tuning, thus boosting ASR performance. We also show cross-lingual transfer from a high-resource language, achieving up to a relative 17% WER reduction over a baseline in a zero-shot setting without any labeled speech.
- Published
- 2024
12. Error-Minimizing Measurements in Postselected One-Shot Symmetric Quantum State Discrimination and Acceptance as a Performance Metric
- Author
-
Gupta, Saurabh Kumar and Gupta, Abhishek K.
- Subjects
Quantum Physics ,Computer Science - Information Theory - Abstract
In hypothesis testing with quantum states, given a black box containing one of the two possible states, measurement is performed to detect in favor of one of the hypotheses. In postselected hypothesis testing, a third outcome is added, corresponding to not selecting any of the hypotheses. In postselected scenario, minimum error one-shot symmetric hypothesis testing is characterized in literature conditioned on the fact that one of the selected outcomes occur. We proceed further in this direction to give the set of all possible measurements that lead to the minimum error. We have given an arbitrary error-minimizing measurement in a parametric form. Note that not selecting any of the hypotheses decimates the quality of testing. We further give an example to show that these measurements vary in quality. There is a need to discuss the quality of postselected hypothesis testing. We then characterize the quality of postselected hypothesis testing by defining a new metric acceptance and give expression of acceptance for an arbitrary error-minimizing measurement in terms of some parameters of the measurement. On the set of measurements that achieve minimum error, we have maximized the acceptance, and given an example which achieves that, thus giving an example of the best possible measurement in terms of acceptance.
- Published
- 2024
13. Semantically Controllable Augmentations for Generalizable Robot Learning
- Author
-
Chen, Zoey, Mandi, Zhao, Bharadhwaj, Homanga, Sharma, Mohit, Song, Shuran, Gupta, Abhishek, and Kumar, Vikash
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Machine Learning - Abstract
Generalization to unseen real-world scenarios for robot manipulation requires exposure to diverse datasets during training. However, collecting large real-world datasets is intractable due to high operational costs. For robot learning to generalize despite these challenges, it is essential to leverage sources of data or priors beyond the robot's direct experience. In this work, we posit that image-text generative models, which are pre-trained on large corpora of web-scraped data, can serve as such a data source. These generative models encompass a broad range of real-world scenarios beyond a robot's direct experience and can synthesize novel synthetic experiences that expose robotic agents to additional world priors aiding real-world generalization at no extra cost. In particular, our approach leverages pre-trained generative models as an effective tool for data augmentation. We propose a generative augmentation framework for semantically controllable augmentations and rapidly multiplying robot datasets while inducing rich variations that enable real-world generalization. Based on diverse augmentations of robot data, we show how scalable robot manipulation policies can be trained and deployed both in simulation and in unseen real-world environments such as kitchens and table-tops. By demonstrating the effectiveness of image-text generative models in diverse real-world robotic applications, our generative augmentation framework provides a scalable and efficient path for boosting generalization in robot learning at no extra human cost., Comment: Accepted for publication by IJRR. First 3 authors contributed equally. Last 3 authors advised equally
- Published
- 2024
14. Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
- Author
-
Poddar, Sriyash, Wan, Yanming, Ivison, Hamish, Gupta, Abhishek, and Jaques, Natasha
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language ,Computer Science - Robotics - Abstract
Reinforcement Learning from Human Feedback (RLHF) is a powerful paradigm for aligning foundation models to human values and preferences. However, current RLHF techniques cannot account for the naturally occurring differences in individual human preferences across a diverse population. When these differences arise, traditional RLHF frameworks simply average over them, leading to inaccurate rewards and poor performance for individual subgroups. To address the need for pluralistic alignment, we develop a class of multimodal RLHF methods. Our proposed techniques are based on a latent variable formulation - inferring a novel user-specific latent and learning reward models and policies conditioned on this latent without additional user-specific data. While conceptually simple, we show that in practice, this reward modeling requires careful algorithmic considerations around model architecture and reward scaling. To empirically validate our proposed technique, we first show that it can provide a way to combat underspecification in simulated control problems, inferring and optimizing user-specific reward functions. Next, we conduct experiments on pluralistic language datasets representing diverse user preferences and demonstrate improved reward function accuracy. We additionally show the benefits of this probabilistic framework in terms of measuring uncertainty, and actively learning user preferences. This work enables learning from diverse populations of users with divergent preferences, an important challenge that naturally occurs in problems from robot learning to foundation model alignment., Comment: weirdlabuw.github.io/vpl
- Published
- 2024
15. Improving Self-supervised Pre-training using Accent-Specific Codebooks
- Author
-
Prabhu, Darshan, Gupta, Abhishek, Nitsure, Omkar, Jyothi, Preethi, and Ganapathy, Sriram
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning ,Computer Science - Sound ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
Speech accents present a serious challenge to the performance of state-of-the-art end-to-end Automatic Speech Recognition (ASR) systems. Even with self-supervised learning and pre-training of ASR models, accent invariance is seldom achieved. In this work, we propose an accent-aware adaptation technique for self-supervised learning that introduces a trainable set of accent-specific codebooks to the self-supervised architecture. These learnable codebooks enable the model to capture accent specific information during pre-training, that is further refined during ASR finetuning. On the Mozilla Common Voice dataset, our proposed approach outperforms all other accent-adaptation approaches on both seen and unseen English accents, with up to 9% relative reduction in word error rate (WER)., Comment: Accepted to INTERSPEECH 2024
- Published
- 2024
16. Embracing Nonlinearity and Geometry: A dimensional analysis guided design of shock absorbing materials
- Author
-
Gupta, Abhishek, Chawla, Komal, and Thevamaran, Ramathasan
- Subjects
Condensed Matter - Materials Science ,Physics - Applied Physics - Abstract
Protective applications require energy-absorbing materials that are soft and compressible enough to absorb kinetic energy from impacts, yet stiff enough to bear crushing loads. Achieving this balance requires careful consideration of both mechanical properties and geometric design. Conventional shock-absorbing pads are made of very thick foams that exhibit a plateau of constant stress in their stress-strain response. Contrary to this belief, we report that foams with a nonlinear stress-strain response can be useful to achieve simultaneously thin and lightweight protective pads. We introduce a new framework for the thickness or volume-constrained design of compact and lightweight protective foams while ensuring the desired structural integrity and mechanical performance. Our streamlined dimensional analysis approach provides geometric constraints on the dimensionless thickness and cross-sectional area of a protective foam with a given stress-strain response to limit the acceleration and compressive strain within desired critical limits. We also identify optimal mechanical properties that will result in the most compact and lightest protective foam layer for absorbing a given kinetic energy of impact. Guided by this design framework, we achieve optimal protective properties in hierarchically architected vertically aligned carbon nanotube (VACNT) foams, enabling next generation protective applications in extreme environments.
- Published
- 2024
17. Data Efficient Behavior Cloning for Fine Manipulation via Continuity-based Corrective Labels
- Author
-
Deshpande, Abhay, Ke, Liyiming, Pfeifer, Quinn, Gupta, Abhishek, and Srinivasa, Siddhartha S.
- Subjects
Computer Science - Robotics - Abstract
We consider imitation learning with access only to expert demonstrations, whose real-world application is often limited by covariate shift due to compounding errors during execution. We investigate the effectiveness of the Continuity-based Corrective Labels for Imitation Learning (CCIL) framework in mitigating this issue for real-world fine manipulation tasks. CCIL generates corrective labels by learning a locally continuous dynamics model from demonstrations to guide the agent back toward expert states. Through extensive experiments on peg insertion and fine grasping, we provide the first empirical validation that CCIL can significantly improve imitation learning performance despite discontinuities present in contact-rich manipulation. We find that: (1) real-world manipulation exhibits sufficient local smoothness to apply CCIL, (2) generated corrective labels are most beneficial in low-data regimes, and (3) label filtering based on estimated dynamics model error enables performance gains. To effectively apply CCIL to robotic domains, we offer a practical instantiation of the framework and insights into design choices and hyperparameter selection. Our work demonstrates CCIL's practicality for alleviating compounding errors in imitation learning on physical robots., Comment: Presented at IROS 2024
- Published
- 2024
18. URDFormer: A Pipeline for Constructing Articulated Simulation Environments from Real-World Images
- Author
-
Chen, Zoey, Walsman, Aaron, Memmel, Marius, Mo, Kaichun, Fang, Alex, Vemuri, Karthikeya, Wu, Alan, Fox, Dieter, and Gupta, Abhishek
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence - Abstract
Constructing simulation scenes that are both visually and physically realistic is a problem of practical interest in domains ranging from robotics to computer vision. This problem has become even more relevant as researchers wielding large data-hungry learning methods seek new sources of training data for physical decision-making systems. However, building simulation models is often still done by hand. A graphic designer and a simulation engineer work with predefined assets to construct rich scenes with realistic dynamic and kinematic properties. While this may scale to small numbers of scenes, to achieve the generalization properties that are required for data-driven robotic control, we require a pipeline that is able to synthesize large numbers of realistic scenes, complete with 'natural' kinematic and dynamic structures. To attack this problem, we develop models for inferring structure and generating simulation scenes from natural images, allowing for scalable scene generation from web-scale datasets. To train these image-to-simulation models, we show how controllable text-to-image generative models can be used in generating paired training data that allows for modeling of the inverse problem, mapping from realistic images back to complete scene models. We show how this paradigm allows us to build large datasets of scenes in simulation with semantic and physical realism. We present an integrated end-to-end pipeline that generates simulation scenes complete with articulated kinematic and dynamic structures from real-world images and use these for training robotic control policies. We then robustly deploy in the real world for tasks like articulated object manipulation. In doing so, our work provides both a pipeline for large-scale generation of simulation environments and an integrated system for training robust robotic control policies in the resulting environments., Comment: Accepted at RSS2024
- Published
- 2024
19. Parameter Identification for Electrochemical Models of Lithium-Ion Batteries Using Bayesian Optimization
- Author
-
Pi, Jianzong, da Silva, Samuel Filgueira, Ozkan, Mehmet Fatih, Gupta, Abhishek, and Canova, Marcello
- Subjects
Electrical Engineering and Systems Science - Systems and Control ,Computer Science - Machine Learning - Abstract
Efficient parameter identification of electrochemical models is crucial for accurate monitoring and control of lithium-ion cells. This process becomes challenging when applied to complex models that rely on a considerable number of interdependent parameters that affect the output response. Gradient-based and metaheuristic optimization techniques, although previously employed for this task, are limited by their lack of robustness, high computational costs, and susceptibility to local minima. In this study, Bayesian Optimization is used for tuning the dynamic parameters of an electrochemical equivalent circuit battery model (E-ECM) for a nickel-manganese-cobalt (NMC)-graphite cell. The performance of the Bayesian Optimization is compared with baseline methods based on gradient-based and metaheuristic approaches. The robustness of the parameter optimization method is tested by performing verification using an experimental drive cycle. The results indicate that Bayesian Optimization outperforms Gradient Descent and PSO optimization techniques, achieving reductions on average testing loss by 28.8% and 5.8%, respectively. Moreover, Bayesian optimization significantly reduces the variance in testing loss by 95.8% and 72.7%, respectively., Comment: 6 pages
- Published
- 2024
20. DFT analysis of structural and optoelectronic properties of (PEO)8-LiFSI complexes
- Author
-
Agrahari, Shweta, Singh, Satya Pal, and Gupta, Abhishek Kumar
- Published
- 2025
- Full Text
- View/download PDF
21. Comparing Melting Curves of Metals Using the Equation of State and Lindemann's Law
- Author
-
Srivastava, Abhay P., Pandey, Brijesh K., Gupta, Abhishek K., and Shukla, Sachchidanand
- Published
- 2024
- Full Text
- View/download PDF
22. Predictive simulation framework for replicating energy reduction trends and joint kinematic adaptations in walking with powered ankle exoskeletons
- Author
-
Ganesan, Karthick and Gupta, Abhishek
- Published
- 2024
- Full Text
- View/download PDF
23. A New Approach to Evaluate Pressure of Solids at High Compression
- Author
-
Srivastava, Abhay P., Pandey, Brijesh K., Gupta, Abhishek Kumar, and Pandey, Anjani Kumar
- Published
- 2024
- Full Text
- View/download PDF
24. Synthesis Of Optical Active Light Harvesting Graphene-COFs Hybrid Materials For Artificial Photosynthesis: Efficient Solar-To-Chemical Energy Conversion
- Author
-
Shukla, Ravindra K., Yadav, Rajesh Kumar, Gole, V. L., Gupta, Abhishek Kumar, Shahin, Rehana, Mishra, Shaifali, Sharma, Kanchan, Dwivedi, D. K., and Baeg, Jin OoK
- Published
- 2024
- Full Text
- View/download PDF
25. Bayesian neural networks modeling for tool wear prediction in milling Al 6061 T6 under MQL conditions
- Author
-
Airao, Jay, Gupta, Abhishek, Nirala, Chandrakant K., and Hsue, Albert Wen-Jeng
- Published
- 2024
- Full Text
- View/download PDF
26. Health Data Sciences and Cardiovascular Diseases in South Asia: Innovations and Challenges in Digital Health
- Author
-
Gupta, Kartik, Junaid, Vashma, Qureshi, Muhammad Ahmad, Gupta, Abhishek, Sheikh, Sana, Dalakoti, Mayank, Virani, Salim S., and Khoja, Adeel
- Published
- 2024
- Full Text
- View/download PDF
27. Rank2Reward: Learning Shaped Reward Functions from Passive Video
- Author
-
Yang, Daniel, Tjia, Davin, Berg, Jacob, Damen, Dima, Agrawal, Pulkit, and Gupta, Abhishek
- Subjects
Computer Science - Robotics - Abstract
Teaching robots novel skills with demonstrations via human-in-the-loop data collection techniques like kinesthetic teaching or teleoperation puts a heavy burden on human supervisors. In contrast to this paradigm, it is often significantly easier to provide raw, action-free visual data of tasks being performed. Moreover, this data can even be mined from video datasets or the web. Ideally, this data can serve to guide robot learning for new tasks in novel environments, informing both "what" to do and "how" to do it. A powerful way to encode both the "what" and the "how" is to infer a well-shaped reward function for reinforcement learning. The challenge is determining how to ground visual demonstration inputs into a well-shaped and informative reward function. We propose a technique Rank2Reward for learning behaviors from videos of tasks being performed without access to any low-level states and actions. We do so by leveraging the videos to learn a reward function that measures incremental "progress" through a task by learning how to temporally rank the video frames in a demonstration. By inferring an appropriate ranking, the reward function is able to guide reinforcement learning by indicating when task progress is being made. This ranking function can be integrated into an adversarial imitation learning scheme resulting in an algorithm that can learn behaviors without exploiting the learned reward function. We demonstrate the effectiveness of Rank2Reward at learning behaviors from raw video on a number of tabletop manipulation tasks in both simulations and on a real-world robotic arm. We also demonstrate how Rank2Reward can be easily extended to be applicable to web-scale video datasets., Comment: ICRA 2024
- Published
- 2024
28. Bridging the Gap Between Theory and Practice: Benchmarking Transfer Evolutionary Optimization
- Author
-
Hou, Yaqing, Ma, Wenqiang, Gupta, Abhishek, Bali, Kavitesh Kumar, Ge, Hongwei, Zhang, Qiang, Coello, Carlos A. Coello, and Ong, Yew-Soon
- Subjects
Computer Science - Neural and Evolutionary Computing - Abstract
In recent years, the field of Transfer Evolutionary Optimization (TrEO) has witnessed substantial growth, fueled by the realization of its profound impact on solving complex problems. Numerous algorithms have emerged to address the challenges posed by transferring knowledge between tasks. However, the recently highlighted ``no free lunch theorem'' in transfer optimization clarifies that no single algorithm reigns supreme across diverse problem types. This paper addresses this conundrum by adopting a benchmarking approach to evaluate the performance of various TrEO algorithms in realistic scenarios. Despite the growing methodological focus on transfer optimization, existing benchmark problems often fall short due to inadequate design, predominantly featuring synthetic problems that lack real-world relevance. This paper pioneers a practical TrEO benchmark suite, integrating problems from the literature categorized based on the three essential aspects of Big Source Task-Instances: volume, variety, and velocity. Our primary objective is to provide a comprehensive analysis of existing TrEO algorithms and pave the way for the development of new approaches to tackle practical challenges. By introducing realistic benchmarks that embody the three dimensions of volume, variety, and velocity, we aim to foster a deeper understanding of algorithmic performance in the face of diverse and complex transfer scenarios. This benchmark suite is poised to serve as a valuable resource for researchers, facilitating the refinement and advancement of TrEO algorithms in the pursuit of solving real-world problems., Comment: 17 pages, 18 figures
- Published
- 2024
29. ASID: Active Exploration for System Identification in Robotic Manipulation
- Author
-
Memmel, Marius, Wagenmaker, Andrew, Zhu, Chuning, Yin, Patrick, Fox, Dieter, and Gupta, Abhishek
- Subjects
Computer Science - Robotics ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Systems and Control - Abstract
Model-free control strategies such as reinforcement learning have shown the ability to learn control strategies without requiring an accurate model or simulator of the world. While this is appealing due to the lack of modeling requirements, such methods can be sample inefficient, making them impractical in many real-world domains. On the other hand, model-based control techniques leveraging accurate simulators can circumvent these challenges and use a large amount of cheap simulation data to learn controllers that can effectively transfer to the real world. The challenge with such model-based techniques is the requirement for an extremely accurate simulation, requiring both the specification of appropriate simulation assets and physical parameters. This requires considerable human effort to design for every environment being considered. In this work, we propose a learning system that can leverage a small amount of real-world data to autonomously refine a simulation model and then plan an accurate control strategy that can be deployed in the real world. Our approach critically relies on utilizing an initial (possibly inaccurate) simulator to design effective exploration policies that, when deployed in the real world, collect high-quality data. We demonstrate the efficacy of this paradigm in identifying articulation, mass, and other physical parameters in several challenging robotic manipulation tasks, and illustrate that only a small amount of real-world data can allow for effective sim-to-real transfer. Project website at https://weirdlabuw.github.io/asid, Comment: Project website at https://weirdlabuw.github.io/asid
- Published
- 2024
30. Damping Reveals Hidden Dimensions in Elastic Metastructures Through Induced Transparency
- Author
-
Fang, Yanghao, Tuxbury, William, Gupta, Abhishek, Kottos, Tsampikos, and Thevamaran, Ramathasan
- Subjects
Physics - Applied Physics ,Condensed Matter - Materials Science ,Condensed Matter - Other Condensed Matter - Abstract
Damping typically results in attenuation of vibrations and elastic wave propagation in mechanical systems. Contrary to this conventional understanding, we demonstrate experimentally and explain theoretically the revival of an elastic wave transmitted through a periodic metastructure when a weak non-Hermitian defect (damping mechanism) induces violation of time-reversal symmetry. Damping alters the nature of the system's resonant modes, instigating interference in the scattering field. This leads to transmission revival, revealing the presence of hidden modes which are otherwise masked by the symmetry. Our findings offer an innovative approach for designing dissipation-driven switches and controllers and non-destructive structural health monitoring systems.
- Published
- 2024
- Full Text
- View/download PDF
31. Multi-Task Learning with Multi-Task Optimization
- Author
-
Bai, Lu, Gupta, Abhishek, and Ong, Yew-Soon
- Subjects
Computer Science - Artificial Intelligence - Abstract
Multi-task learning solves multiple correlated tasks. However, conflicts may exist between them. In such circumstances, a single solution can rarely optimize all the tasks, leading to performance trade-offs. To arrive at a set of optimized yet well-distributed models that collectively embody different trade-offs in one algorithmic pass, this paper proposes to view Pareto multi-task learning through the lens of multi-task optimization. Multi-task learning is first cast as a multi-objective optimization problem, which is then decomposed into a diverse set of unconstrained scalar-valued subproblems. These subproblems are solved jointly using a novel multi-task gradient descent method, whose uniqueness lies in the iterative transfer of model parameters among the subproblems during the course of optimization. A theorem proving faster convergence through the inclusion of such transfers is presented. We investigate the proposed multi-task learning with multi-task optimization for solving various problem settings including image classification, scene understanding, and multi-target regression. Comprehensive experiments confirm that the proposed method significantly advances the state-of-the-art in discovering sets of Pareto-optimized models. Notably, on the large image dataset we tested on, namely NYUv2, the hypervolume convergence achieved by our method was found to be nearly two times faster than the next-best among the state-of-the-art.
- Published
- 2024
32. DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
- Author
-
Khazatsky, Alexander, Pertsch, Karl, Nair, Suraj, Balakrishna, Ashwin, Dasari, Sudeep, Karamcheti, Siddharth, Nasiriany, Soroush, Srirama, Mohan Kumar, Chen, Lawrence Yunliang, Ellis, Kirsty, Fagan, Peter David, Hejna, Joey, Itkina, Masha, Lepert, Marion, Ma, Yecheng Jason, Miller, Patrick Tree, Wu, Jimmy, Belkhale, Suneel, Dass, Shivin, Ha, Huy, Jain, Arhan, Lee, Abraham, Lee, Youngwoon, Memmel, Marius, Park, Sungjae, Radosavovic, Ilija, Wang, Kaiyuan, Zhan, Albert, Black, Kevin, Chi, Cheng, Hatch, Kyle Beltran, Lin, Shan, Lu, Jingpei, Mercat, Jean, Rehman, Abdul, Sanketi, Pannag R, Sharma, Archit, Simpson, Cody, Vuong, Quan, Walke, Homer Rich, Wulfe, Blake, Xiao, Ted, Yang, Jonathan Heewon, Yavary, Arefeh, Zhao, Tony Z., Agia, Christopher, Baijal, Rohan, Castro, Mateo Guaman, Chen, Daphne, Chen, Qiuyu, Chung, Trinity, Drake, Jaimyn, Foster, Ethan Paul, Gao, Jensen, Herrera, David Antonio, Heo, Minho, Hsu, Kyle, Hu, Jiaheng, Jackson, Donovon, Le, Charlotte, Li, Yunshuang, Lin, Kevin, Lin, Roy, Ma, Zehan, Maddukuri, Abhiram, Mirchandani, Suvir, Morton, Daniel, Nguyen, Tony, O'Neill, Abigail, Scalise, Rosario, Seale, Derick, Son, Victor, Tian, Stephen, Tran, Emi, Wang, Andrew E., Wu, Yilin, Xie, Annie, Yang, Jingyun, Yin, Patrick, Zhang, Yunchu, Bastani, Osbert, Berseth, Glen, Bohg, Jeannette, Goldberg, Ken, Gupta, Abhinav, Gupta, Abhishek, Jayaraman, Dinesh, Lim, Joseph J, Malik, Jitendra, Martín-Martín, Roberto, Ramamoorthy, Subramanian, Sadigh, Dorsa, Song, Shuran, Wu, Jiajun, Yip, Michael C., Zhu, Yuke, Kollar, Thomas, Levine, Sergey, and Finn, Chelsea
- Subjects
Computer Science - Robotics - Abstract
The creation of large, diverse, high-quality robot manipulation datasets is an important stepping stone on the path toward more capable and robust robotic manipulation policies. However, creating such datasets is challenging: collecting robot manipulation data in diverse environments poses logistical and safety challenges and requires substantial investments in hardware and human labour. As a result, even the most general robot manipulation policies today are mostly trained on data collected in a small number of environments with limited scene and task diversity. In this work, we introduce DROID (Distributed Robot Interaction Dataset), a diverse robot manipulation dataset with 76k demonstration trajectories or 350 hours of interaction data, collected across 564 scenes and 84 tasks by 50 data collectors in North America, Asia, and Europe over the course of 12 months. We demonstrate that training with DROID leads to policies with higher performance and improved generalization ability. We open source the full dataset, policy learning code, and a detailed guide for reproducing our robot hardware setup., Comment: Project website: https://droid-dataset.github.io/
- Published
- 2024
33. Distributional Successor Features Enable Zero-Shot Policy Optimization
- Author
-
Zhu, Chuning, Wang, Xinqi, Han, Tyler, Du, Simon S., and Gupta, Abhishek
- Subjects
Computer Science - Machine Learning - Abstract
Intelligent agents must be generalists, capable of quickly adapting to various tasks. In reinforcement learning (RL), model-based RL learns a dynamics model of the world, in principle enabling transfer to arbitrary reward functions through planning. However, autoregressive model rollouts suffer from compounding error, making model-based RL ineffective for long-horizon problems. Successor features offer an alternative by modeling a policy's long-term state occupancy, reducing policy evaluation under new rewards to linear regression. Yet, zero-shot policy optimization for new tasks with successor features can be challenging. This work proposes a novel class of models, i.e., Distributional Successor Features for Zero-Shot Policy Optimization (DiSPOs), that learn a distribution of successor features of a stationary dataset's behavior policy, along with a policy that acts to realize different successor features achievable within the dataset. By directly modeling long-term outcomes in the dataset, DiSPOs avoid compounding error while enabling a simple scheme for zero-shot policy optimization across reward functions. We present a practical instantiation of DiSPOs using diffusion models and show their efficacy as a new class of transferable models, both theoretically and empirically across various simulated robotics problems. Videos and code available at https://weirdlabuw.github.io/dispo/.
- Published
- 2024
34. Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation
- Author
-
Torne, Marcel, Simeonov, Anthony, Li, Zechu, Chan, April, Chen, Tao, Gupta, Abhishek, and Agrawal, Pulkit
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Imitation learning methods need significant human supervision to learn policies robust to changes in object poses, physical disturbances, and visual distractors. Reinforcement learning, on the other hand, can explore the environment autonomously to learn robust behaviors but may require impractical amounts of unsafe real-world data collection. To learn performant, robust policies without the burden of unsafe real-world data collection or extensive human supervision, we propose RialTo, a system for robustifying real-world imitation learning policies via reinforcement learning in "digital twin" simulation environments constructed on the fly from small amounts of real-world data. To enable this real-to-sim-to-real pipeline, RialTo proposes an easy-to-use interface for quickly scanning and constructing digital twins of real-world environments. We also introduce a novel "inverse distillation" procedure for bringing real-world demonstrations into simulated environments for efficient fine-tuning, with minimal human intervention and engineering required. We evaluate RialTo across a variety of robotic manipulation problems in the real world, such as robustly stacking dishes on a rack, placing books on a shelf, and six other tasks. RialTo increases (over 67%) in policy robustness without requiring extensive human data collection. Project website and videos at https://real-to-sim-to-real.github.io/RialTo/, Comment: Project page: https://real-to-sim-to-real.github.io/RialTo/
- Published
- 2024
35. Ophthalmoplegia with ophthalmic artery occlusion as initial manifestation of Mucormycosis: a photo essay
- Author
-
Gupta, Abhishek, Dutta, Mainak, Priya, Sweksha, Maiti, Mousam, Haokip, Grace Lhinghoithem, Singh, Prabhakar, Choudhary, Anjali, Banerjee, Prabrisha, Gupta, Richa, and Chattopadhyay, Mahuya
- Published
- 2024
- Full Text
- View/download PDF
36. Retinal haemorrhages as initial manifestation of leukaemia: a photo essay
- Author
-
Gupta, Abhishek, Gupta, Richa, Singh, Prabhakar, Banerjee, Prabrisha, Priya, Sweksha, Ghosh, Anusuya, Agrawal, Smriti, Kumari, Sandhya, Chakraborty, Koyel, Chattopadhyay, Mahuya, and Sinha, Shivani
- Published
- 2024
- Full Text
- View/download PDF
37. Spatial Heterogeneity in Health Risk Assessment of Heavy Metals During North-East Monsoon and South-West Monsoon over India
- Author
-
Rani, Martina, Ahlawat, Sakshi, Vijayan, N., Yadav, Lokesh, Banerjee, Tirthankar, Chatterjee, Abhijeet, Bhatti, Manpreet Singh, Das, Trupti, Dhir, Amit, Goel, Sangita, Khan, Altaf Husain, Khaiwal, Ravindra, Kuniyal, Jagdish Chandra, Lakhani, Anita, Gupta, Abhishek, Pal, Srimanta, Saikia, Prasenjit, Vyas, B. M., Mor, Suman, and Mandal, Tuhin Kumar
- Published
- 2024
- Full Text
- View/download PDF
38. Automated segmentation of acute leukemia using blood and bone marrow smear images: a systematic review
- Author
-
Raina, Rohini, Gondhi, Naveen Kumar, and Gupta, Abhishek
- Published
- 2024
- Full Text
- View/download PDF
39. Sun-powered synthesis: harnessing multiwall carbon nanotube-EB photocatalytic magic in a unified photocatalytic-biocatalytic system for solar- driven L-glutamate production from ɑ-ketoglutarate
- Author
-
Mishra, Abhishek, Yadav, Rajesh K., Mishra, Shaifali, Shahin, Rehana, Singh, Satyam, Gupta, Abhishek Kumar, Singhal, Rajat, Gupta, Navneet K., Baeg, Jin-OoK, El-Hiti, Gamal A., Yadav, Krishna Kumar, and Singh, Sunita
- Published
- 2024
- Full Text
- View/download PDF
40. A novel validated instrumental thin-layer chromatographic method and marker-based standardization of liquid herbal formulation using atropine, rutin and vanillin as biomarkers
- Author
-
Varma, Vibha R., Gupta, Abhishek A., and Dhande, Swati R.
- Published
- 2024
- Full Text
- View/download PDF
41. SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
- Author
-
Luo, Jianlan, Hu, Zheyuan, Xu, Charles, Tan, You Liang, Berg, Jacob, Sharma, Archit, Schaal, Stefan, Finn, Chelsea, Gupta, Abhishek, and Levine, Sergey
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence - Abstract
In recent years, significant progress has been made in the field of robotic reinforcement learning (RL), enabling methods that handle complex image observations, train in the real world, and incorporate auxiliary data, such as demonstrations and prior experience. However, despite these advances, robotic RL remains hard to use. It is acknowledged among practitioners that the particular implementation details of these algorithms are often just as important (if not more so) for performance as the choice of algorithm. We posit that a significant challenge to widespread adoption of robotic RL, as well as further development of robotic RL methods, is the comparative inaccessibility of such methods. To address this challenge, we developed a carefully implemented library containing a sample efficient off-policy deep RL method, together with methods for computing rewards and resetting the environment, a high-quality controller for a widely-adopted robot, and a number of challenging example tasks. We provide this library as a resource for the community, describe its design choices, and present experimental results. Perhaps surprisingly, we find that our implementation can achieve very efficient learning, acquiring policies for PCB board assembly, cable routing, and object relocation between 25 to 50 minutes of training per policy on average, improving over state-of-the-art results reported for similar tasks in the literature. These policies achieve perfect or near-perfect success rates, extreme robustness even under perturbations, and exhibit emergent recovery and correction behaviors. We hope that these promising results and our high-quality open-source implementation will provide a tool for the robotics community to facilitate further developments in robotic RL. Our code, documentation, and videos can be found at https://serl-robot.github.io/, Comment: ICRA 2024
- Published
- 2024
42. Dynamic Capital Requirements for Markov Decision Processes
- Author
-
Haskell, William B., Gupta, Abhishek, and Shao, Shiping
- Subjects
Mathematics - Optimization and Control - Abstract
We build on the theory of capital requirements (CRs) to create a new framework for modeling dynamic risk preferences. The key question is how to evaluate the risk of a payoff stream sequentially as new information is revealed. In our model, we associate each payoff stream with a disbursement strategy and a premium schedule to form a triple of stochastic processes. We characterize risk preferences in terms of a single set that we call the risk frontier which characterizes acceptable triples. We then propose the generalized capital requirement (GCR) which evaluates the risk of a payoff stream by minimizing the premium schedule over acceptable triples. We apply this model to a risk-aware decision maker (DM) who controls a Markov decision process (MDP) and wants to find a policy to minimize the GCR of its payoff stream. The resulting GCR-MDP recovers many well-known risk-aware MDPs as special cases. To make this approach computationally viable, we obtain the temporal decomposition of the GCR in terms of the risk frontier. Then, we connect the temporal decomposition with the notion of an information state to compactly capture the dependence of DM's risk preferences on the problem history, where augmented dynamic programming can be used to compute an optimal policy. We report numerical experiments for the GCR-minimizing newsvendor.
- Published
- 2024
43. On the Target Detection Performance of a Molecular Communication Network with Multiple Mobile Nanomachines
- Author
-
Sabu, Nithin V. and Gupta, Abhishek K.
- Subjects
Computer Science - Information Theory ,Electrical Engineering and Systems Science - Signal Processing - Abstract
A network of nanomachines (NMs) can be used to build a target detection system for a variety of promising applications. They have the potential to detect toxic chemicals, infectious bacteria, and biomarkers of dangerous diseases such as cancer within the human body. Many diseases and health disorders can be detected early and efficiently treated in the future by utilizing these systems. To fully grasp the potential of these systems, mathematical analysis is required. This paper describes an analytical framework for modeling and analyzing the performance of target detection systems composed of multiple mobile nanomachines of varying sizes with passive/absorbing boundaries. We consider both direct contact detection, in which NMs must physically contact the target to detect it, and indirect sensing, in which NMs must detect the marker molecules emitted by the target. The detection performance of such systems is calculated for degradable and non-degradable targets, as well as mobile and stationary targets. The derived expressions provide various insights, such as the effect of NM density and target degradation on detection probability.
- Published
- 2024
44. Multiform Evolution for High-Dimensional Problems with Low Effective Dimensionality
- Author
-
Hou, Yaqing, Sun, Mingyang, Gupta, Abhishek, Jin, Yaochu, Piao, Haiyin, Ge, Hongwei, and Zhang, Qiang
- Subjects
Computer Science - Neural and Evolutionary Computing - Abstract
In this paper, we scale evolutionary algorithms to high-dimensional optimization problems that deceptively possess a low effective dimensionality (certain dimensions do not significantly affect the objective function). To this end, an instantiation of the multiform optimization paradigm is presented, where multiple low-dimensional counterparts of a target high-dimensional task are generated via random embeddings. Since the exact relationship between the auxiliary (low-dimensional) tasks and the target is a priori unknown, a multiform evolutionary algorithm is developed for unifying all formulations into a single multi-task setting. The resultant joint optimization enables the target task to efficiently reuse solutions evolved across various low-dimensional searches via cross-form genetic transfers, hence speeding up overall convergence characteristics. To validate the overall efficacy of our proposed algorithmic framework, comprehensive experimental studies are carried out on well-known continuous benchmark functions as well as a set of practical problems in the hyper-parameter tuning of machine learning models and deep learning models in classification tasks and Predator-Prey games, respectively., Comment: 12 pages,6 figures
- Published
- 2023
45. Bayesian Inverse Transfer in Evolutionary Multiobjective Optimization
- Author
-
Liu, Jiao, Gupta, Abhishek, and Ong, Yew-Soon
- Subjects
Computer Science - Neural and Evolutionary Computing - Abstract
Transfer optimization enables data-efficient optimization of a target task by leveraging experiential priors from related source tasks. This is especially useful in multiobjective optimization settings where a set of trade-off solutions is sought under tight evaluation budgets. In this paper, we introduce a novel concept of \textit{inverse transfer} in multiobjective optimization. Inverse transfer stands out by employing Bayesian inverse Gaussian process models to map performance vectors in the objective space to population search distributions in task-specific decision space, facilitating knowledge transfer through objective space unification. Building upon this idea, we introduce the first Inverse Transfer Evolutionary Multiobjective Optimizer (invTrEMO). A key highlight of invTrEMO is its ability to harness the common objective functions prevalent in many application areas, even when decision spaces do not precisely align between tasks. This allows invTrEMO to uniquely and effectively utilize information from heterogeneous source tasks as well. Furthermore, invTrEMO yields high-precision inverse models as a significant byproduct, enabling the generation of tailored solutions on-demand based on user preferences. Empirical studies on multi- and many-objective benchmark problems, as well as a practical case study, showcase the faster convergence rate and modelling accuracy of the invTrEMO relative to state-of-the-art evolutionary and Bayesian optimization algorithms. The source code of the invTrEMO is made available at https://github.com/LiuJ-2023/invTrEMO.
- Published
- 2023
46. Model-Free Change Point Detection for Mixing Processes
- Author
-
Chen, Hao, Gupta, Abhishek, Sun, Yin, and Shroff, Ness
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
This paper considers the change point detection problem under dependent samples. In particular, we provide performance guarantees for the MMD-CUSUM test under exponentially $\alpha$, $\beta$, and fast $\phi$-mixing processes, which significantly expands its utility beyond the i.i.d. and Markovian cases used in previous studies. We obtain lower bounds for average-run-length (ARL) and upper bounds for average-detection-delay (ADD) in terms of the threshold parameter. We show that the MMD-CUSUM test enjoys the same level of performance as the i.i.d. case under fast $\phi$-mixing processes. The MMD-CUSUM test also achieves strong performance under exponentially $\alpha$/$\beta$-mixing processes, which are significantly more relaxed than existing results. The MMD-CUSUM test statistic adapts to different settings without modifications, rendering it a completely data-driven, dependence-agnostic change point detection scheme. Numerical simulations are provided at the end to evaluate our findings., Comment: 20 pages, 4 figures. Accepted by IEEE OJ-CSYS
- Published
- 2023
- Full Text
- View/download PDF
47. Modeling Boundedly Rational Agents with Latent Inference Budgets
- Author
-
Jacob, Athul Paul, Gupta, Abhishek, and Andreas, Jacob
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
We study the problem of modeling a population of agents pursuing unknown goals subject to unknown computational constraints. In standard models of bounded rationality, sub-optimal decision-making is simulated by adding homoscedastic noise to optimal decisions rather than explicitly simulating constrained inference. In this work, we introduce a latent inference budget model (L-IBM) that models agents' computational constraints explicitly, via a latent variable (inferred jointly with a model of agents' goals) that controls the runtime of an iterative inference algorithm. L-IBMs make it possible to learn agent models using data from diverse populations of suboptimal actors. In three modeling tasks -- inferring navigation goals from routes, inferring communicative intents from human utterances, and predicting next moves in human chess games -- we show that L-IBMs match or outperform Boltzmann models of decision-making under uncertainty. Inferred inference budgets are themselves meaningful, efficient to compute, and correlated with measures of player skill, partner skill and task difficulty.
- Published
- 2023
48. The Baldwin Effect in Advancing Generalizability of Physics-Informed Neural Networks
- Author
-
Wong, Jian Cheng, Ooi, Chin Chun, Gupta, Abhishek, Chiu, Pao-Hsiung, Low, Joshua Shao Zheng, Dao, My Ha, and Ong, Yew-Soon
- Subjects
Computer Science - Neural and Evolutionary Computing ,Computer Science - Computational Engineering, Finance, and Science ,Computer Science - Machine Learning - Abstract
Physics-informed neural networks (PINNs) are at the forefront of scientific machine learning, making possible the creation of machine intelligence that is cognizant of physical laws and able to accurately simulate them. However, today's PINNs are often trained for a single physics task and require computationally expensive re-training for each new task, even for tasks from similar physics domains. To address this limitation, this paper proposes a pioneering approach to advance the generalizability of PINNs through the framework of Baldwinian evolution. Drawing inspiration from the neurodevelopment of precocial species that have evolved to learn, predict and react quickly to their environment, we envision PINNs that are pre-wired with connection strengths inducing strong biases towards efficient learning of physics. A novel two-stage stochastic programming formulation coupling evolutionary selection pressure (based on proficiency over a distribution of physics tasks) with lifetime learning (to specialize on a sampled subset of those tasks) is proposed to instantiate the Baldwin effect. The evolved Baldwinian-PINNs demonstrate fast and physics-compliant prediction capabilities across a range of empirically challenging problem instances with more than an order of magnitude improvement in prediction accuracy at a fraction of the computation cost compared to state-of-the-art gradient-based meta-learning methods. For example, when solving the diffusion-reaction equation, a 70x improvement in accuracy was obtained while taking 700x less computational time. This paper thus marks a leap forward in the meta-learning of PINNs as generalizable physics solvers. Sample codes are available at \url{https://github.com/chiuph/Baldwinian-PINN}.
- Published
- 2023
49. New Epochs in AI Supervision: Design and Implementation of an Autonomous Radiology AI Monitoring System
- Author
-
Venugopal, Vasantha Kumar, Gupta, Abhishek, Takhar, Rohit, and Mahajan, Vidur
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
With the increasingly widespread adoption of AI in healthcare, maintaining the accuracy and reliability of AI models in clinical practice has become crucial. In this context, we introduce novel methods for monitoring the performance of radiology AI classification models in practice, addressing the challenges of obtaining real-time ground truth for performance monitoring. We propose two metrics - predictive divergence and temporal stability - to be used for preemptive alerts of AI performance changes. Predictive divergence, measured using Kullback-Leibler and Jensen-Shannon divergences, evaluates model accuracy by comparing predictions with those of two supplementary models. Temporal stability is assessed through a comparison of current predictions against historical moving averages, identifying potential model decay or data drift. This approach was retrospectively validated using chest X-ray data from a single-center imaging clinic, demonstrating its effectiveness in maintaining AI model reliability. By providing continuous, real-time insights into model performance, our system ensures the safe and effective use of AI in clinical decision-making, paving the way for more robust AI integration in healthcare, Comment: 10 pages, 4 figures, 2 tables
- Published
- 2023
50. Evolution-guided Bayesian optimization for constrained multi-objective optimization in self-driving labs
- Author
-
Low, Andre K. Y., Mekki-Berrada, Flore, Gupta, Abhishek, Ostudin, Aleksandr, Xie, Jiaxun, Vissol-Gaudin, Eleonore, Lim, Yee-Fun, Li, Qianxiao, Ong, Yew Soon, Khan, Saif A., and Hippalgaonkar, Kedar
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.