Author: "Chaudhury, P." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chaudhury, P."' showing total 3,611 results

Start Over Author "Chaudhury, P."

3,611 results on '"Chaudhury, P."'

1. Large Language Models can be Strong Self-Detoxifiers

Author: Ko, Ching-Yun, Chen, Pin-Yu, Das, Payel, Mroueh, Youssef, Dan, Soham, Kollias, Georgios, Chaudhury, Subhajit, Pedapati, Tejaswini, and Daniel, Luca
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Reducing the likelihood of generating harmful and toxic output is an essential task when aligning large language models (LLMs). Existing methods mainly rely on training an external reward model (i.e., another language model) or fine-tuning the LLM using self-generated data to influence the outcome. In this paper, we show that LLMs have the capability of self-detoxification without the use of an additional reward model or re-training. We propose \textit{Self-disciplined Autoregressive Sampling (SASA)}, a lightweight controlled decoding algorithm for toxicity reduction of LLMs. SASA leverages the contextual representations from an LLM to learn linear subspaces characterizing toxic v.s. non-toxic output in analytical forms. When auto-completing a response token-by-token, SASA dynamically tracks the margin of the current output to steer the generation away from the toxic subspace, by adjusting the autoregressive sampling strategy. Evaluated on LLMs of different scale and nature, namely Llama-3.1-Instruct (8B), Llama-2 (7B), and GPT2-L models with the RealToxicityPrompts, BOLD, and AttaQ benchmarks, SASA markedly enhances the quality of the generated sentences relative to the original models and attains comparable performance to state-of-the-art detoxification techniques, significantly reducing the toxicity level by only using the LLM's internal representations., Comment: 20 pages
Published: 2024

2. Survival Prediction in Lung Cancer through Multi-Modal Representation Learning

Author: Farooq, Aiman, Mishra, Deepak, and Chaudhury, Santanu
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Survival prediction is a crucial task associated with cancer diagnosis and treatment planning. This paper presents a novel approach to survival prediction by harnessing comprehensive information from CT and PET scans, along with associated Genomic data. Current methods rely on either a single modality or the integration of multiple modalities for prediction without adequately addressing associations across patients or modalities. We aim to develop a robust predictive model for survival outcomes by integrating multi-modal imaging data with genetic information while accounting for associations across patients and modalities. We learn representations for each modality via a self-supervised module and harness the semantic similarities across the patients to ensure the embeddings are aligned closely. However, optimizing solely for global relevance is inadequate, as many pairs sharing similar high-level semantics, such as tumor type, are inadvertently pushed apart in the embedding space. To address this issue, we use a cross-patient module (CPM) designed to harness inter-subject correspondences. The CPM module aims to bring together embeddings from patients with similar disease characteristics. Our experimental evaluation of the dataset of Non-Small Cell Lung Cancer (NSCLC) patients demonstrates the effectiveness of our approach in predicting survival outcomes, outperforming state-of-the-art methods., Comment: Accepted in WACV 2025
Published: 2024

3. Incorporating dense metric depth into neural 3D representations for view synthesis and relighting

Author: Chaudhury, Arkadeep Narayan, Vasiljevic, Igor, Zakharov, Sergey, Guizilini, Vitor, Ambrus, Rares, Narasimhan, Srinivasa, and Atkeson, Christopher G.
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Graphics, Computer Science - Robotics
Abstract: Synthesizing accurate geometry and photo-realistic appearance of small scenes is an active area of research with compelling use cases in gaming, virtual reality, robotic-manipulation, autonomous driving, convenient product capture, and consumer-level photography. When applying scene geometry and appearance estimation techniques to robotics, we found that the narrow cone of possible viewpoints due to the limited range of robot motion and scene clutter caused current estimation techniques to produce poor quality estimates or even fail. On the other hand, in robotic applications, dense metric depth can often be measured directly using stereo and illumination can be controlled. Depth can provide a good initial estimate of the object geometry to improve reconstruction, while multi-illumination images can facilitate relighting. In this work we demonstrate a method to incorporate dense metric depth into the training of neural 3D representations and address an artifact observed while jointly refining geometry and appearance by disambiguating between texture and geometry edges. We also discuss a multi-flash stereo camera system developed to capture the necessary data for our pipeline and show results on relighting and view synthesis with a few training views., Comment: Project webpage: https://stereomfc.github.io
Published: 2024

4. Translating Imaging to Genomics: Leveraging Transformers for Predictive Modeling

Author: Farooq, Aiman, Mishra, Deepak, and Chaudhury, Santanu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In this study, we present a novel approach for predicting genomic information from medical imaging modalities using a transformer-based model. We aim to bridge the gap between imaging and genomics data by leveraging transformer networks, allowing for accurate genomic profile predictions from CT/MRI images. Presently most studies rely on the use of whole slide images (WSI) for the association, which are obtained via invasive methodologies. We propose using only available CT/MRI images to predict genomic sequences. Our transformer based approach is able to efficiently generate associations between multiple sequences based on CT/MRI images alone. This work paves the way for the use of non-invasive imaging modalities for precise and personalized healthcare, allowing for a better understanding of diseases and treatment.
Published: 2024

5. Generation Constraint Scaling Can Mitigate Hallucination

Author: Kollias, Georgios, Das, Payel, and Chaudhury, Subhajit
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Addressing the issue of hallucinations in large language models (LLMs) is a critical challenge. As the cognitive mechanisms of hallucination have been related to memory, here we explore hallucination for LLM that is enabled with explicit memory mechanisms. We empirically demonstrate that by simply scaling the readout vector that constrains generation in a memory-augmented LLM decoder, hallucination mitigation can be achieved in a training-free manner. Our method is geometry-inspired and outperforms a state-of-the-art LLM editing method on the task of generation of Wikipedia-like biography entries both in terms of generation quality and runtime complexity., Comment: 7 pages; accepted at ICML 2024 Workshop on Large Language Models and Cognition
Published: 2024

6. Micro-Ring Modulator Linearity Enhancement for Analog and Digital Optical Links

Author: Chaudhury, Sumilak, Johnson, Karl, Gao, Chengkuan, Lin, Bill, Fainman, Yeshaiahu, and Hsueh, Tzu-Chien
Subjects: Electrical Engineering and Systems Science - Systems and Control, Physics - Optics
Abstract: An energy/area-efficient low-cost broadband linearity enhancement technique for electro-optic micro-ring modulators (MRM) is proposed to achieve 6.1-dB dynamic linearity improvement in spurious-free-dynamic-range with intermodulation distortions (IMD) and 17.9-dB static linearity improvement in integral nonlinearity over a conventional notch-filter MRM within a 4.8-dB extinction-ratio (ER) full-scale range based on rapid silicon-photonics fabrication results for the emerging applications of various analog and digital optical communication systems., Comment: 4 pages, 5 figures
Published: 2024

7. Needle in the Haystack for Memory Based Large Language Models

Author: Nelson, Elliot, Kollias, Georgios, Das, Payel, Chaudhury, Subhajit, and Dan, Soham
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Current large language models (LLMs) often perform poorly on simple fact retrieval tasks. Here we investigate if coupling a dynamically adaptable external memory to a LLM can alleviate this problem. For this purpose, we test Larimar, a recently proposed language model architecture which uses an external associative memory, on long-context recall tasks including passkey and needle-in-the-haystack tests. We demonstrate that the external memory of Larimar, which allows fast write and read of an episode of text samples, can be used at test time to handle contexts much longer than those seen during training. We further show that the latent readouts from the memory (to which long contexts are written) control the decoder towards generating correct outputs, with the memory stored off of the GPU. Compared to existing transformer-based LLM architectures for long-context recall tasks that use larger parameter counts or modified attention mechanisms, a relatively smaller size Larimar is able to maintain strong performance without any task-specific training or training on longer contexts., Comment: 5 pages; slightly revised abstract
Published: 2024

8. TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data Lakes

Author: Khatiwada, Aamod, Kokel, Harsha, Abdelaziz, Ibrahim, Chaudhury, Subhajit, Dolby, Julian, Hassanzadeh, Oktie, Huang, Zhenhan, Pedapati, Tejaswini, Samulowitz, Horst, and Srinivas, Kavitha
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Databases
Abstract: Enterprises have a growing need to identify relevant tables in data lakes; e.g. tables that are unionable, joinable, or subsets of each other. Tabular neural models can be helpful for such data discovery tasks. In this paper, we present TabSketchFM, a neural tabular model for data discovery over data lakes. First, we propose novel pre-training: a sketch-based approach to enhance the effectiveness of data discovery in neural tabular models. Second, we finetune the pretrained model for identifying unionable, joinable, and subset table pairs and show significant improvement over previous tabular neural models. Third, we present a detailed ablation study to highlight which sketches are crucial for which tasks. Fourth, we use these finetuned models to perform table search; i.e., given a query table, find other tables in a corpus that are unionable, joinable, or that are subsets of the query. Our results demonstrate significant improvements in F1 scores for search compared to state-of-the-art techniques. Finally, we show significant transfer across datasets and tasks establishing that our model can generalize across different tasks and over different data lakes.
Published: 2024

9. Differentiating Prodromal Dementia with Lewy Bodies from Prodromal Alzheimers Disease: A Pragmatic Review for Clinicians.

Author: Wyman-Chick, Kathryn, Chaudhury, Parichita, Abdelnour, Carla, Matar, Elie, Chiu, Shannon, Ferreira, Daniel, Hamilton, Calum, Donaghy, Paul, Rodriguez-Porcel, Federico, Toledo, Jon, Habich, Annegret, Barrett, Matthew, Patel, Bhavana, Jaramillo-Jimenez, Alberto, Scott, Gregory, Kane, Joseph, and Bayram, Ece
Subjects: Biomarkers, Clinical diagnosis, Early-stage dementia, Mild cognitive impairment, Neuropsychological profile, Psychiatric symptoms, Treatment planning
Abstract: This pragmatic review synthesises the current understanding of prodromal dementia with Lewy bodies (pDLB) and prodromal Alzheimers disease (pAD), including clinical presentations, neuropsychological profiles, neuropsychiatric symptoms, biomarkers, and indications for disease management. The core clinical features of dementia with Lewy bodies (DLB)-parkinsonism, complex visual hallucinations, cognitive fluctuations, and REM sleep behaviour disorder are common prodromal symptoms. Supportive clinical features of pDLB include severe neuroleptic sensitivity, as well as autonomic and neuropsychiatric symptoms. The neuropsychological profile in mild cognitive impairment attributable to Lewy body pathology (MCI-LB) tends to include impairment in visuospatial skills and executive functioning, distinguishing it from MCI due to AD, which typically presents with impairment in memory. pDLB may present with cognitive impairment, psychiatric symptoms, and/or recurrent episodes of delirium, indicating that it is not necessarily synonymous with MCI-LB. Imaging, fluid and other biomarkers may play a crucial role in differentiating pDLB from pAD. The current MCI-LB criteria recognise low dopamine transporter uptake using positron emission tomography or single photon emission computed tomography (SPECT), loss of REM atonia on polysomnography, and sympathetic cardiac denervation using meta-iodobenzylguanidine SPECT as indicative biomarkers with slowing of dominant frequency on EEG among others as supportive biomarkers. This review also highlights the emergence of fluid and skin-based biomarkers. There is little research evidence for the treatment of pDLB, but pharmacological and non-pharmacological treatments for DLB may be discussed with patients. Non-pharmacological interventions such as diet, exercise, and cognitive stimulation may provide benefit, while evaluation and management of contributing factors like medications and sleep disturbances are vital. There is a need to expand research across diverse patient populations to address existing disparities in clinical trial participation. In conclusion, an early and accurate diagnosis of pDLB or pAD presents an opportunity for tailored interventions, improved healthcare outcomes, and enhanced quality of life for patients and care partners.
Published: 2024

10. Alpha rhythm slowing in temporal epilepsy across Scalp EEG and MEG

Author: Janiukstyte, Vytene, Kozma, Csaba, Owen, Thomas W., Chaudhury, Umair J, Diehl, Beate, Lemieux, Louis, Duncan, John S, Rugg-Gunn, Fergus, de Tisi, Jane, Wang, Yujiang, and Taylor, Peter N.
Subjects: Quantitative Biology - Neurons and Cognition
Abstract: EEG slowing is reported in various neurological disorders including Alzheimer's, Parkinson's and Epilepsy. Here, we investigate alpha rhythm slowing in individuals with refractory temporal lobe epilepsy (TLE), compared to healthy controls, using scalp electroencephalography (EEG) and magnetoencephalography (MEG). We retrospectively analysed data from 17,(46) healthy controls and 22,(24) individuals with TLE who underwent scalp EEG and (MEG) recordings as part of presurgical evaluation. Resting-state, eyes-closed recordings were source reconstructed using the standardized low-resolution brain electrographic tomography (sLORETA) method. We extracted low (slow) 6-9 Hz and high (fast) 10-11 Hz alpha relative band power and calculated the alpha power ratio by dividing low (slow) alpha by high (fast) alpha. This ratio was computed for all brain regions in all individuals. Alpha oscillations were slower in individuals with TLE than controls (p<0.05). This effect was present in both the ipsilateral and contralateral hemispheres, and across widespread brain regions. Alpha slowing in TLE was found in both EEG and MEG recordings. We interpret greater low (slow)-alpha as greater deviation from health.
Published: 2024

11. On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning

Author: Gruppi, Mauricio, Dan, Soham, Murugesan, Keerthiram, and Chaudhury, Subhajit
Subjects: Computer Science - Computation and Language
Abstract: Text-based reinforcement learning involves an agent interacting with a fictional environment using observed text and admissible actions in natural language to complete a task. Previous works have shown that agents can succeed in text-based interactive environments even in the complete absence of semantic understanding or other linguistic capabilities. The success of these agents in playing such games suggests that semantic understanding may not be important for the task. This raises an important question about the benefits of LMs in guiding the agents through the game states. In this work, we show that rich semantic understanding leads to efficient training of text-based RL agents. Moreover, we describe the occurrence of semantic degeneration as a consequence of inappropriate fine-tuning of language models in text-based reinforcement learning (TBRL). Specifically, we describe the shift in the semantic representation of words in the LM, as well as how it affects the performance of the agent in tasks that are semantically similar to the training games. We believe these results may help develop better strategies to fine-tune agents in text-based RL scenarios.
Published: 2024

12. Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation

Author: Chaudhury, Rohan, Godbole, Mihir, Garg, Aakash, and Seo, Jinsil Hwaryoung
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction
Abstract: Contemporary conversational systems often present a significant limitation: their responses lack the emotional depth and disfluent characteristic of human interactions. This absence becomes particularly noticeable when users seek more personalized and empathetic interactions. Consequently, this makes them seem mechanical and less relatable to human users. Recognizing this gap, we embarked on a journey to humanize machine communication, to ensure AI systems not only comprehend but also resonate. To address this shortcoming, we have designed an innovative speech synthesis pipeline. Within this framework, a cutting-edge language model introduces both human-like emotion and disfluencies in a zero-shot setting. These intricacies are seamlessly integrated into the generated text by the language model during text generation, allowing the system to mirror human speech patterns better, promoting more intuitive and natural user interactions. These generated elements are then adeptly transformed into corresponding speech patterns and emotive sounds using a rule-based approach during the text-to-speech phase. Based on our experiments, our novel system produces synthesized speech that's almost indistinguishable from genuine human communication, making each interaction feel more personal and authentic., Comment: 10 pages, 1 figure, for associated code and media files, see https://github.com/Rohan-Chaudhury/Humane-Speech-Synthesis-through-Zero-Shot-Emotion-and-Disfluency-Generation
Published: 2024

13. Larimar: Large Language Models with Episodic Memory Control

Author: Das, Payel, Chaudhury, Subhajit, Nelson, Elliot, Melnyk, Igor, Swaminathan, Sarath, Dai, Sihui, Lozano, Aurélie, Kollias, Georgios, Chenthamarakshan, Vijil, Jiří, Navrátil, Dan, Soham, and Chen, Pin-Yu
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Efficient and accurate updating of knowledge stored in Large Language Models (LLMs) is one of the most pressing research challenges today. This paper presents Larimar - a novel, brain-inspired architecture for enhancing LLMs with a distributed episodic memory. Larimar's memory allows for dynamic, one-shot updates of knowledge without the need for computationally expensive re-training or fine-tuning. Experimental results on multiple fact editing benchmarks demonstrate that Larimar attains accuracy comparable to most competitive baselines, even in the challenging sequential editing setup, but also excels in speed - yielding speed-ups of 8-10x depending on the base LLM - as well as flexibility due to the proposed architecture being simple, LLM-agnostic, and hence general. We further provide mechanisms for selective fact forgetting, information leakage prevention, and input context length generalization with Larimar and show their effectiveness. Our code is available at https://github.com/IBM/larimar, Comment: ICML 2024
Published: 2024

14. EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning

Author: Basu, Kinjal, Murugesan, Keerthiram, Chaudhury, Subhajit, Campbell, Murray, Talamadupula, Kartik, and Klinger, Tim
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Logic in Computer Science
Abstract: Text-based games (TBGs) have emerged as an important collection of NLP tasks, requiring reinforcement learning (RL) agents to combine natural language understanding with reasoning. A key challenge for agents attempting to solve such tasks is to generalize across multiple games and demonstrate good performance on both seen and unseen objects. Purely deep-RL-based approaches may perform well on seen objects; however, they fail to showcase the same performance on unseen objects. Commonsense-infused deep-RL agents may work better on unseen data; unfortunately, their policies are often not interpretable or easily transferable. To tackle these issues, in this paper, we present EXPLORER which is an exploration-guided reasoning agent for textual reinforcement learning. EXPLORER is neurosymbolic in nature, as it relies on a neural module for exploration and a symbolic module for exploitation. It can also learn generalized symbolic policies and perform well over unseen data. Our experiments show that EXPLORER outperforms the baseline agents on Text-World cooking (TW-Cooking) and Text-World Commonsense (TWC) games.
Published: 2024

15. Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Author: Achintalwar, Swapnaja, Garcia, Adriana Alvarado, Anaby-Tavor, Ateret, Baldini, Ioana, Berger, Sara E., Bhattacharjee, Bishwaranjan, Bouneffouf, Djallel, Chaudhury, Subhajit, Chen, Pin-Yu, Chiazor, Lamogha, Daly, Elizabeth M., DB, Kirushikesh, de Paula, Rogério Abreu, Dognin, Pierre, Farchi, Eitan, Ghosh, Soumya, Hind, Michael, Horesh, Raya, Kour, George, Lee, Ja Young, Madaan, Nishtha, Mehta, Sameep, Miehling, Erik, Murugesan, Keerthiram, Nagireddy, Manish, Padhi, Inkit, Piorkowski, David, Rawat, Ambrish, Raz, Orna, Sattigeri, Prasanna, Strobelt, Hendrik, Swaminathan, Sarathkrishna, Tillmann, Christoph, Trivedi, Aashka, Varshney, Kush R., Wei, Dennis, Witherspooon, Shalisha, and Zalmanovici, Marcel
Subjects: Computer Science - Machine Learning
Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we present our ongoing efforts to create and deploy a library of detectors: compact and easy-to-build classification models that provide labels for various harms. In addition to the detectors themselves, we discuss a wide range of uses for these detector models - from acting as guardrails to enabling effective AI governance. We also deep dive into inherent challenges in their development and discuss future work aimed at making the detectors more reliable and broadening their scope.
Published: 2024

16. CONCORD: enhancing COVID-19 research with weak-supervision based numerical claim extraction

Author: Shah, Dhwanil, Shah, Krish, Jagani, Manan, Shah, Agam, and Chaudhury, Bhaskar
Published: 2024
Full Text: View/download PDF

17. API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs

Author: Basu, Kinjal, Abdelaziz, Ibrahim, Chaudhury, Subhajit, Dan, Soham, Crouse, Maxwell, Munawar, Asim, Kumaravel, Sadhana, Muthusamy, Vinod, Kapanipathi, Pavan, and Lastras, Luis A.
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: There is a growing need for Large Language Models (LLMs) to effectively use tools and external Application Programming Interfaces (APIs) to plan and complete tasks. As such, there is tremendous interest in methods that can acquire sufficient quantities of train and test data that involve calls to tools / APIs. Two lines of research have emerged as the predominant strategies for addressing this challenge. The first has focused on synthetic data generation techniques, while the second has involved curating task-adjacent datasets which can be transformed into API / Tool-based tasks. In this paper, we focus on the task of identifying, curating, and transforming existing datasets and, in turn, introduce API-BLEND, a large corpora for training and systematic testing of tool-augmented LLMs. The datasets mimic real-world scenarios involving API-tasks such as API / tool detection, slot filling, and sequencing of the detected APIs. We demonstrate the utility of the API-BLEND dataset for both training and benchmarking purposes., Comment: Accepted at ACL'24-main conference
Published: 2024

18. Numerical Claim Detection in Finance: A New Financial Dataset, Weak-Supervision Model, and Market Analysis

Author: Shah, Agam, Hiray, Arnav, Shah, Pratvi, Banerjee, Arkaprabha, Singh, Anushka, Eidnani, Dheeraj, Chava, Sahasra, Chaudhury, Bhaskar, and Chava, Sudheer
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Quantitative Finance - Computational Finance
Abstract: In this paper, we investigate the influence of claims in analyst reports and earnings calls on financial market returns, considering them as significant quarterly events for publicly traded companies. To facilitate a comprehensive analysis, we construct a new financial dataset for the claim detection task in the financial domain. We benchmark various language models on this dataset and propose a novel weak-supervision model that incorporates the knowledge of subject matter experts (SMEs) in the aggregation function, outperforming existing approaches. We also demonstrate the practical utility of our proposed model by constructing a novel measure of optimism. Here, we observe the dependence of earnings surprise and return on our optimism measure. Our dataset, models, and code are publicly (under CC BY 4.0 license) available on GitHub., Comment: Accepted at The Seventh FEVER Workshop EMNLP 2024
Published: 2024

19. Competitive Equilibrium for Chores: from Dual Eisenberg-Gale to a Fast, Greedy, LP-based Algorithm

Author: Chaudhury, Bhaskar Ray, Kroer, Christian, Mehta, Ruta, and Nan, Tianlong
Subjects: Computer Science - Computer Science and Game Theory, Mathematics - Optimization and Control
Abstract: We study the computation of competitive equilibrium for Fisher markets with $n$ agents and $m$ divisible chores. Prior work showed that competitive equilibria correspond to the nonzero KKT points of a non-convex analogue of the Eisenberg-Gale convex program. We introduce an analogue of the Eisenberg-Gale dual for chores: we show that all KKT points of this dual correspond to competitive equilibria, and while it is not a dual of the non-convex primal program in a formal sense, the objectives touch at all KKT points. Similar to the primal, the dual has problems from an optimization perspective: there are many feasible directions where the objective tends to positive infinity. We then derive a new constraint for the dual, which restricts optimization to a hyperplane that avoids all these directions. We show that restriction to this hyperplane retains all KKT points, and surprisingly, does not introduce any new ones. This allows, for the first time ever, application of iterative optimization methods over a convex region for computing competitive equilibria for chores. We next introduce a greedy Frank-Wolfe algorithm for optimization over our program and show a state-of-the-art convergence rate to competitive equilibrium. In the case of equal incomes, we show a $\mathcal{\tilde O}(n/\epsilon^2)$ rate of convergence, which improves over the two prior state-of-the-art rates of $\mathcal{\tilde O}(n^3/\epsilon^2)$ for an exterior-point method and $\mathcal{\tilde O}(nm/\epsilon^2)$ for a combinatorial method. Moreover, our method is significantly simpler: each iteration of our method only requires solving a simple linear program. We show through numerical experiments on simulated data and a paper review bidding dataset that our method is extremely practical. This is the first highly practical method for solving competitive equilibrium for Fisher markets with chores., Comment: 25 pages, 17 figures
Published: 2024

20. LISR: Learning Linear 3D Implicit Surface Representation Using Compactly Supported Radial Basis Functions

Author: Pandey, Atharva, Yadav, Vishal, Nagar, Rajendra, and Chaudhury, Santanu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Implicit 3D surface reconstruction of an object from its partial and noisy 3D point cloud scan is the classical geometry processing and 3D computer vision problem. In the literature, various 3D shape representations have been developed, differing in memory efficiency and shape retrieval effectiveness, such as volumetric, parametric, and implicit surfaces. Radial basis functions provide memory-efficient parameterization of the implicit surface. However, we show that training a neural network using the mean squared error between the ground-truth implicit surface and the linear basis-based implicit surfaces does not converge to the global solution. In this work, we propose locally supported compact radial basis functions for a linear representation of the implicit surface. This representation enables us to generate 3D shapes with arbitrary topologies at any resolution due to their continuous nature. We then propose a neural network architecture for learning the linear implicit shape representation of the 3D surface of an object. We learn linear implicit shapes within a supervised learning framework using ground truth Signed-Distance Field (SDF) data for guidance. The classical strategies face difficulties in finding linear implicit shapes from a given 3D point cloud due to numerical issues (requires solving inverse of a large matrix) in basis and query point selection. The proposed approach achieves better Chamfer distance and comparable F-score than the state-of-the-art approach on the benchmark dataset. We also show the effectiveness of the proposed approach by using it for the 3D shape completion task.
Published: 2024

21. Capability enhancement of the X-ray micro-tomography system via ML-assisted approaches

Author: Shah, Dhruvi, Mehta, Shruti, Agrawal, Ashish, Purohit, Shishir, and Chaudhury, Bhaskar
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Machine Learning, Physics - Applied Physics, Physics - Instrumentation and Detectors
Abstract: Ring artifacts in X-ray micro-CT images are one of the primary causes of concern in their accurate visual interpretation and quantitative analysis. The geometry of X-ray micro-CT scanners is similar to the medical CT machines, except the sample is rotated with a stationary source and detector. The ring artifacts are caused by a defect or non-linear responses in detector pixels during the MicroCT data acquisition. Artifacts in MicroCT images can often be so severe that the images are no longer useful for further analysis. Therefore, it is essential to comprehend the causes of artifacts and potential solutions to maximize image quality. This article presents a convolution neural network (CNN)-based Deep Learning (DL) model inspired by UNet with a series of encoder and decoder units with skip connections for removal of ring artifacts. The proposed architecture has been evaluated using the Structural Similarity Index Measure (SSIM) and Mean Squared Error (MSE). Additionally, the results are compared with conventional filter-based non-ML techniques and are found to be better than the latter.
Published: 2024

22. Understanding and Attaining an Investment Grade Rating in the Age of Explainable AI

Author: Makwana, Ravi, Bhatt, Dhruvil, Delwadia, Kirtan, Shah, Agam, and Chaudhury, Bhaskar
Published: 2024
Full Text: View/download PDF

23. Chronic kidney disease and the global public health agenda: an international consensus

Author: Francis, Anna, Harhay, Meera N., Ong, Albert C. M., Tummalapalli, Sri Lekha, Ortiz, Alberto, Fogo, Agnes B., Fliser, Danilo, Roy-Chaudhury, Prabir, Fontana, Monica, Nangaku, Masaomi, Wanner, Christoph, Malik, Charu, Hradsky, Anne, Adu, Dwomoa, Bavanandan, Sunita, Cusumano, Ana, Sola, Laura, Ulasi, Ifeoma, and Jha, Vivekanand
Published: 2024
Full Text: View/download PDF

24. Effect of Disturbance on Micro-environment, Soil Properties and Microbial Biomass in Subtropical Broadleaved Forests of Meghalaya, India

Author: Barbhuyan, Humayun Samir Ahmed, Upadhaya, Krishna, Chaudhury, Gunjana, and Mir, Aabid Hussain
Published: 2024
Full Text: View/download PDF

25. Averaged Deep Denoisers for Image Regularization

Author: Nair, Pravin and Chaudhury, Kunal N.
Published: 2024
Full Text: View/download PDF

26. Sums of squares of integral multiples of an integral element of real bi-quadratic fields

Author: Chaudhury, Srijonee Shabnam
Published: 2024
Full Text: View/download PDF

27. On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $\epsilon$-Greedy Exploration

Author: Zhang, Shuai, Li, Hongkang, Wang, Meng, Liu, Miao, Chen, Pin-Yu, Lu, Songtao, Liu, Sijia, Murugesan, Keerthiram, and Chaudhury, Subhajit
Subjects: Computer Science - Machine Learning
Abstract: This paper provides a theoretical understanding of Deep Q-Network (DQN) with the $\varepsilon$-greedy exploration in deep reinforcement learning. Despite the tremendous empirical achievement of the DQN, its theoretical characterization remains underexplored. First, the exploration strategy is either impractical or ignored in the existing analysis. Second, in contrast to conventional Q-learning algorithms, the DQN employs the target network and experience replay to acquire an unbiased estimation of the mean-square Bellman error (MSBE) utilized in training the Q-network. However, the existing theoretical analysis of DQNs lacks convergence analysis or bypasses the technical challenges by deploying a significantly overparameterized neural network, which is not computationally efficient. This paper provides the first theoretical convergence and sample complexity analysis of the practical setting of DQNs with $\epsilon$-greedy policy. We prove an iterative procedure with decaying $\epsilon$ converges to the optimal Q-value function geometrically. Moreover, a higher level of $\epsilon$ values enlarges the region of convergence but slows down the convergence, while the opposite holds for a lower level of $\epsilon$ values. Experiments justify our established theoretical insights on DQNs.
Published: 2023

28. On the Contractivity of Plug-and-Play Operators

Author: Athalye, Chirayu D., Chaudhury, Kunal N., and Kumar, Bhartendu
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: In plug-and-play (PnP) regularization, the proximal operator in algorithms such as ISTA and ADMM is replaced by a powerful denoiser. This formal substitution works surprisingly well in practice. In fact, PnP has been shown to give state-of-the-art results for various imaging applications. The empirical success of PnP has motivated researchers to understand its theoretical underpinnings and, in particular, its convergence. It was shown in prior work that for kernel denoisers such as the nonlocal means, PnP-ISTA provably converges under some strong assumptions on the forward model. The present work is motivated by the following questions: Can we relax the assumptions on the forward model? Can the convergence analysis be extended to PnP-ADMM? Can we estimate the convergence rate? In this letter, we resolve these questions using the contraction mapping theorem: (i) for symmetric denoisers, we show that (under mild conditions) PnP-ISTA and PnP-ADMM exhibit linear convergence; and (ii) for kernel denoisers, we show that PnP-ISTA and PnP-ADMM converge linearly for image inpainting. We validate our theoretical findings using reconstruction experiments., Comment: Errors in the proof of Lemma 1 and the statement of Theorem 2 were identified after the publication; these have been rectified in the revised version (v2)
Published: 2023
Full Text: View/download PDF

29. Nivolumab Plus 5-Azacitidine in Pediatric Relapsed/Refractory Acute Myeloid Leukemia (AML): Phase I/II Trial Results from the Therapeutic Advances in Childhood Leukemia and Lymphoma (TACL) Consortium.

Author: Verma, Anupam, Chi, Yueh-Yun, Malvar, Jemily, Lamble, Adam, Chaudhury, Sonali, Agarwal, Archana, Li, Hong-Tao, Liang, Gangning, Leong, Roy, Brown, Patrick, Kaplan, Joel, Schafer, Eric, Slone, Tamra, Pauly, Melinda, Chang, Bill, Wayne, Alan, Hijiya, Nobuko, Bhojwani, Deepa, and Stieglitz, Elliot
Subjects: DNA methylation, acute myeloid leukemia, checkpoint inhibitor, dose-limiting toxicity, immunotherapy
Abstract: Improvements in survival have been made over the past two decades for childhood acute myeloid leukemia (AML), but the approximately 40% of patients who relapse continue to have poor outcomes. A combination of checkpoint-inhibitor nivolumab and azacitidine has demonstrated improvements in median survival in adults with AML. This phase I/II study with nivolumab and azacitidine in children with relapsed/refractory AML (NCT03825367) was conducted through the Therapeutic Advances in Childhood Leukemia & Lymphoma consortium. Thirteen patients, median age 13.7 years, were enrolled. Patients had refractory disease with multiple reinduction attempts. Twelve evaluable patients were treated at the recommended phase II dose (established at dose level 1, 3 mg/kg/dose). Four patients (33%) maintained stable disease. This combination was well tolerated, with no dose-limiting toxicities observed. Grade 3-4 adverse events (AEs) were primarily hematological. Febrile neutropenia was the most common AE ≥ grade 3. A trend to improved quality of life was noted. Increases in CD8+ T cells and reductions in CD4+/CD8+ T cells and demethylation were observed. The combination was well tolerated and had an acceptable safety profile in pediatric patients with relapsed/refractory AML. Future studies might explore this combination for the maintenance of remission in children with AML at high risk of relapse.
Published: 2024

30. Efficient quantum multi-authority attribute-based encryption and generalizations

Author: Chaudhury, Shion Samadder
Published: 2024
Full Text: View/download PDF

31. The crosstalk between microbial sensors ELMO1 and NOD2 shape intestinal immune responses

Author: Sharma, Aditi, Achi, Sajan Chandrangadhan, Ibeawuchi, Stella-Rita, Anandachar, Mahitha Shree, Gementera, Hobie, Chaudhury, Uddeep, Usmani, Fatima, Vega, Kevin, Sayed, Ibrahim M, and Das, Soumita
Subjects: Microbiology, Biochemistry and Cell Biology, Biomedical and Clinical Sciences, Biological Sciences, Autoimmune Disease, Digestive Diseases, Inflammatory Bowel Disease, Crohn's Disease, 1.1 Normal biological development and functioning, 2.1 Biological and endogenous factors, Aetiology, Underpinning research, Inflammatory and immune system, Infection, Oral and gastrointestinal, Animals, Mice, Adaptor Proteins, Signal Transducing, Crohn Disease, Escherichia coli, Immunity, Intestines, Macrophages, Microbial sensors, NOD2, bacterial engulfment, AIEC-LF82, epithelial cells and macrophages, 3D-organoid, ELMO-1, AIEC-LF82, epithelial cells and macrophages, Ecological Applications, Medical Microbiology, Medical microbiology
Abstract: Microbial sensors play an essential role in maintaining cellular homoeostasis. Our knowledge is limited on how microbial sensing helps in differential immune response and its link to inflammatory diseases. Recently we have confirmed that ELMO1 (Engulfment and Cell Motility Protein-1) present in cytosol is involved in pathogen sensing, engulfment, and intestinal inflammation. Here, we show that ELMO1 interacts with another sensor, NOD2 (Nucleotide-binding oligomerization domain-containing protein 2), that recognizes bacterial cell wall component muramyl dipeptide (MDP). The polymorphism of NOD2 is linked to Crohn's disease (CD) pathogenesis. Interestingly, we found that overexpression of ELMO1 and mutant NOD2 (L1007fs) were not able to clear the CD-associated adherent invasive E. coli (AIEC-LF82). The functional implications of ELMO1-NOD2 interaction in epithelial cells were evaluated by using enteroid-derived monolayers (EDMs) from ELMO1 and NOD2 KO mice. Subsequently we also assessed the immune response in J774 macrophages depleted of either ELMO1 or NOD2 or both. The infection of murine EDMs with AIEC-LF82 showed higher bacterial load in ELMO1-KO, NOD2 KO EDMs, and ELMO1 KO EDMs treated with NOD2 inhibitors. The murine macrophage cells showed that the downregulation of ELMO1 and NOD2 is associated with impaired bacterial clearance that is linked to reduce pro-inflammatory cytokines and reactive oxygen species. Our results indicated that the crosstalk between microbial sensors in enteric infection and inflammatory diseases impacts the fate of the bacterial load and disease pathogenesis.
Published: 2023

32. Nighttime-specific differential gene expression in suprachiasmatic nucleus and habenula is associated with resilience to chronic social stress

Author: Priyam Narain, Aleksa Petković, Marko Šušić, Salma Haniffa, Mariam Anwar, Marc Arnoux, Nizar Drou, Giuseppe Antonio-Saldi, and Dipesh Chaudhury
Subjects: Neurosciences. Biological psychiatry. Neuropsychiatry, RC321-571
Abstract: Abstract The molecular mechanisms that link stress and biological rhythms still remain unclear. The habenula (Hb) is a key brain region involved in regulating diverse types of emotion-related behaviours while the suprachiasmatic nucleus (SCN) is the body’s central clock. To investigate the effects of chronic social stress on transcription patterns, we performed gene expression analysis in the Hb and SCN of stress-naïve and stress-exposed mice. Our analysis revealed a large number of differentially expressed genes and enrichment of synaptic and cell signalling pathways between resilient and stress-naïve mice at zeitgeber 16 (ZT16) in both the Hb and SCN. This transcriptomic signature was nighttime-specific and observed only in stress-resilient mice. In contrast, there were relatively few differences between the stress-susceptible and stress-naïve groups across time points. Our results reinforce the functional link between circadian gene expression patterns and differential responses to stress, thereby highlighting the importance of temporal expression patterns in homoeostatic stress responses.
Published: 2024
Full Text: View/download PDF

33. Interplay of degeneracy and non-degeneracy in fluctuations propagation in coherent feed-forward loop motif

Author: Roy, Tuhin Subhra, Nandi, Mintu, Chaudhury, Pinaki, Chattopadhyay, Sudip, and Banik, Suman K
Subjects: Quantitative Biology - Molecular Networks, Physics - Biological Physics
Abstract: We present a stochastic framework to decipher fluctuations propagation in classes of coherent feed-forward loops. The systematic contribution of the direct (one-step) and indirect (two-step) pathways is considered to quantify fluctuations of the output node. We also consider both additive and multiplicative integration mechanisms of the two parallel pathways (one-step and two-step). Analytical expression of the output node's coefficient of variation shows contributions of intrinsic, one-step, two-step, and cross-interaction in closed form. We observe a diverse range of degeneracy and non-degeneracy in each of the decomposed fluctuations term and their contribution to the overall output fluctuations of each coherent feed-forward loop motif. Analysis of output fluctuations reveals a maximal level of fluctuations of the coherent feed-forward loop motif of type 1., Comment: 20 pages, 4 figures
Published: 2023
Full Text: View/download PDF

34. LakeBench: Benchmarks for Data Discovery over Data Lakes

Author: Srinivas, Kavitha, Dolby, Julian, Abdelaziz, Ibrahim, Hassanzadeh, Oktie, Kokel, Harsha, Khatiwada, Aamod, Pedapati, Tejaswini, Chaudhury, Subhajit, and Samulowitz, Horst
Subjects: Computer Science - Databases, Computer Science - Artificial Intelligence
Abstract: Within enterprises, there is a growing need to intelligently navigate data lakes, specifically focusing on data discovery. Of particular importance to enterprises is the ability to find related tables in data repositories. These tables can be unionable, joinable, or subsets of each other. There is a dearth of benchmarks for these tasks in the public domain, with related work targeting private datasets. In LakeBench, we develop multiple benchmarks for these tasks by using the tables that are drawn from a diverse set of data sources such as government data from CKAN, Socrata, and the European Central Bank. We compare the performance of 4 publicly available tabular foundational models on these tasks. None of the existing models had been trained on the data discovery tasks that we developed for this benchmark; not surprisingly, their performance shows significant room for improvement. The results suggest that the establishment of such benchmarks may be useful to the community to build tabular models usable for data discovery in data lakes.
Published: 2023

35. Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning

Author: Chaudhury, Subhajit, Swaminathan, Sarathkrishna, Kimura, Daiki, Sen, Prithviraj, Murugesan, Keerthiram, Uceda-Sosa, Rosario, Tatsubori, Michiaki, Fokoue, Achille, Kapanipathi, Pavan, Munawar, Asim, and Gray, Alexander
Subjects: Computer Science - Computation and Language
Abstract: Text-based reinforcement learning agents have predominantly been neural network-based models with embeddings-based representation, learning uninterpretable policies that often do not generalize well to unseen games. On the other hand, neuro-symbolic methods, specifically those that leverage an intermediate formal representation, are gaining significant attention in language understanding tasks. This is because of their advantages ranging from inherent interpretability, the lesser requirement of training data, and being generalizable in scenarios with unseen data. Therefore, in this paper, we propose a modular, NEuro-Symbolic Textual Agent (NESTA) that combines a generic semantic parser with a rule induction system to learn abstract interpretable rules as policies. Our experiments on established text-based game benchmarks show that the proposed NESTA method outperforms deep reinforcement learning-based techniques by achieving better generalization to unseen test games and learning from fewer training interactions., Comment: ACL 2023
Published: 2023

36. MISMATCH: Fine-grained Evaluation of Machine-generated Text with Mismatch Error Types

Author: Murugesan, Keerthiram, Swaminathan, Sarathkrishna, Dan, Soham, Chaudhury, Subhajit, Gunasekara, Chulaka, Crouse, Maxwell, Mahajan, Diwakar, Abdelaziz, Ibrahim, Fokoue, Achille, Kapanipathi, Pavan, Roukos, Salim, and Gray, Alexander
Subjects: Computer Science - Computation and Language
Abstract: With the growing interest in large language models, the need for evaluating the quality of machine text compared to reference (typically human-generated) text has become focal attention. Most recent works focus either on task-specific evaluation metrics or study the properties of machine-generated text captured by the existing metrics. In this work, we propose a new evaluation scheme to model human judgments in 7 NLP tasks, based on the fine-grained mismatches between a pair of texts. Inspired by the recent efforts in several NLP tasks for fine-grained evaluation, we introduce a set of 13 mismatch error types such as spatial/geographic errors, entity errors, etc, to guide the model for better prediction of human judgments. We propose a neural framework for evaluating machine texts that uses these mismatch error types as auxiliary tasks and re-purposes the existing single-number evaluation metrics as additional scalar features, in addition to textual features extracted from the machine and reference texts. Our experiments reveal key insights about the existing metrics via the mismatch errors. We show that the mismatch errors between the sentence pairs on the held-out datasets from 7 NLP tasks align well with the human evaluation., Comment: Accepted at ACL 2023 (ACL Findings Long)
Published: 2023

37. Demographic and socioeconomic determinants of missing labs and imaging for otolaryngologic clinical visits

Author: Jad Zeitouni, Jyntre Millsap, Wooyoung Jang, Cynthia Schwartz, Hannah Chaudhury, Tristin Chaudhury, and Yusuf Dundar
Subjects: determinates, follow‐up, imaging, missing labs, Otorhinolaryngology, RF1-547, Surgery, RD1-811
Abstract: Abstract Objectives/Hypothesis Socioeconomics and demographics have been shown to be determinates of healthcare in specialty clinics, in which thorough research is lacking in the setting of the United States clinical sphere. We set out to determine the impact of socioeconomic and demographic factors on patient preparedness in an otolaryngologic clinic as to highlight the need for awareness in this aspect of disparate and delayed clinical care. Study Design Retrospective chart review. Methods A chart review was conducted of 482 patients who visited our otolaryngology clinic between June 1, 2020 and June 1, 2023. Demographic data including marital status, gender, age, zip code, and race was collected. Results Our study found several interesting points of significance. Marital status was a significant determinant of whether patients had missing labs and/or imaging (p = .001). Age was a significant determinant of patients having their imaging (p
Published: 2024
Full Text: View/download PDF

38. Nighttime-specific differential gene expression in suprachiasmatic nucleus and habenula is associated with resilience to chronic social stress

Author: Narain, Priyam, Petković, Aleksa, Šušić, Marko, Haniffa, Salma, Anwar, Mariam, Arnoux, Marc, Drou, Nizar, Antonio-Saldi, Giuseppe, and Chaudhury, Dipesh
Published: 2024
Full Text: View/download PDF

39. Midbrain glutamatergic circuit mechanism of resilience to socially transferred allodynia in male mice

Author: Han, Yi, Ai, Lin, Song, Lingzhen, Zhou, Yu, Chen, Dandan, Sha, Sha, Ji, Ran, Li, Qize, Bu, Qingyang, Pan, Xiangyu, Zhai, Xiaojing, Cui, Mengqiao, Duan, Jiawen, Yang, Junxia, Chaudhury, Dipesh, Hu, Ankang, Liu, He, Han, Ming-Hu, Cao, Jun-Li, and Zhang, Hongxing
Published: 2024
Full Text: View/download PDF

40. Standardization of 109Cd Using CIEMAT/NIST Method and Internal Conversion Electron Counting

Author: Kulkarni, D. B., Anuradha, R., Sharma, Ritu, Reddy, P. J., Sathian, V., and Chaudhury, Probal
Published: 2024
Full Text: View/download PDF

41. International Equivalence of 60Co: Sir Measurements of BIPM Comparison

Author: Ravindra, Anuradha, Kulkarni, D. B., Sharma, Ritu, Sathian, V., and Chaudhury, Probal
Published: 2024
Full Text: View/download PDF

42. Estimation of Reference Gamma Radiation Field for Calibration of Radiation Monitors

Author: Singh, Sunil K., Shaiju, Liji, Gupta, Aashna, Tripathi, S. M., Sathian, V., and Chaudhury, Probal
Published: 2024
Full Text: View/download PDF

43. National Audit for Traceable 131I Activity Measurements with Radionuclide Calibrators Among Nuclear Medicine Centers in India

Author: Ravindra, Anuradha, Kulkarni, D. B., Sharma, Ritu, Sathian, V., Chaudhury, Probal, and Aswal, D. K.
Published: 2024
Full Text: View/download PDF

44. Scalable Learning of Latent Language Structure With Logical Offline Cycle Consistency

Author: Crouse, Maxwell, Astudillo, Ramon, Naseem, Tahira, Chaudhury, Subhajit, Kapanipathi, Pavan, Roukos, Salim, and Gray, Alexander
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: We introduce Logical Offline Cycle Consistency Optimization (LOCCO), a scalable, semi-supervised method for training a neural semantic parser. Conceptually, LOCCO can be viewed as a form of self-learning where the semantic parser being trained is used to generate annotations for unlabeled text that are then used as new supervision. To increase the quality of annotations, our method utilizes a count-based prior over valid formal meaning representations and a cycle-consistency score produced by a neural text generation model as additional signals. Both the prior and semantic parser are updated in an alternate fashion from full passes over the training data, which can be seen as approximating the marginalization of latent structures through stochastic variational inference. The use of a count-based prior, frozen text generation model, and offline annotation process yields an approach with negligible complexity and latency increases as compared to conventional self-learning. As an added bonus, the annotations produced by LOCCO can be trivially repurposed to train a neural text generation model. We demonstrate the utility of LOCCO on the well-known WebNLG benchmark where we obtain an improvement of 2 points against a self-learning parser under equivalent conditions, an improvement of 1.3 points against the previous state-of-the-art parser, and competitive text generation performance in terms of BLEU score.
Published: 2023

45. Fair and Efficient Allocation of Indivisible Chores with Surplus

Author: Akrami, Hannaneh, Chaudhury, Bhaskar Ray, Garg, Jugal, Mehlhorn, Kurt, and Mehta, Ruta
Subjects: Computer Science - Computer Science and Game Theory
Abstract: We study fair division of indivisible chores among $n$ agents with additive disutility functions. Two well-studied fairness notions for indivisible items are envy-freeness up to one/any item (EF1/EFX) and the standard notion of economic efficiency is Pareto optimality (PO). There is a noticeable gap between the results known for both EF1 and EFX in the goods and chores settings. The case of chores turns out to be much more challenging. We reduce this gap by providing slightly relaxed versions of the known results on goods for the chores setting. Interestingly, our algorithms run in polynomial time, unlike their analogous versions in the goods setting. We introduce the concept of $k$ surplus which means that up to $k$ more chores are allocated to the agents and each of them is a copy of an original chore. We present a polynomial-time algorithm which gives EF1 and PO allocations with $(n-1)$ surplus. We relax the notion of EFX slightly and define tEFX which requires that the envy from agent $i$ to agent $j$ is removed upon the transfer of any chore from the $i$'s bundle to $j$'s bundle. We give a polynomial-time algorithm that in the chores case for $3$ agents returns an allocation which is either proportional or tEFX. Note that proportionality is a very strong criterion in the case of indivisible items, and hence both notions we guarantee are desirable.
Published: 2023

46. Laziness Is a Virtue When It Comes to Compositionality in Neural Semantic Parsing

Author: Crouse, Maxwell, Kapanipathi, Pavan, Chaudhury, Subhajit, Naseem, Tahira, Astudillo, Ramon, Fokoue, Achille, and Klinger, Tim
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Nearly all general-purpose neural semantic parsers generate logical forms in a strictly top-down autoregressive fashion. Though such systems have achieved impressive results across a variety of datasets and domains, recent works have called into question whether they are ultimately limited in their ability to compositionally generalize. In this work, we approach semantic parsing from, quite literally, the opposite direction; that is, we introduce a neural semantic parsing generation method that constructs logical forms from the bottom up, beginning from the logical form's leaves. The system we introduce is lazy in that it incrementally builds up a set of potential semantic parses, but only expands and processes the most promising candidate parses at each generation step. Such a parsimonious expansion scheme allows the system to maintain an arbitrarily large set of parse hypotheses that are never realized and thus incur minimal computational overhead. We evaluate our approach on compositional generalization; specifically, on the challenging CFQ dataset and three Text-to-SQL datasets where we show that our novel, bottom-up semantic parsing technique outperforms general-purpose semantic parsers while also being competitive with comparable neural parsers that have been designed for each task., Comment: Accepted to ACL main conference
Published: 2023

47. Deep Learning assisted microwave-plasma interaction based technique for plasma density estimation

Author: Ghosh, Pratik, Chaudhury, Bhaskar, Purohit, Shishir, Joshi, Vishv, Kothari, Ashray, and Shetranjiwala, Devdeep
Subjects: Physics - Plasma Physics, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Physics - Computational Physics
Abstract: The electron density is a key parameter to characterize any plasma. Most of the plasma applications and research in the area of low-temperature plasmas (LTPs) are based on the accurate estimations of plasma density and plasma temperature. The conventional methods for electron density measurements offer axial and radial profiles for any given linear LTP device. These methods have major disadvantages of operational range (not very wide), cumbersome instrumentation, and complicated data analysis procedures. The article proposes a Deep Learning (DL) assisted microwave-plasma interaction-based non-invasive strategy, which can be used as a new alternative approach to address some of the challenges associated with existing plasma density measurement techniques. The electric field pattern due to microwave scattering from plasma is utilized to estimate the density profile. The proof of concept is tested for a simulated training data set comprising a low-temperature, unmagnetized, collisional plasma. Different types of symmetric (Gaussian-shaped) and asymmetrical density profiles, in the range $10^{16}-10^{19}$ m$^{-3}$, addressing a range of experimental configurations have been considered in our study. Real-life experimental issues such as the presence of noise and the amount of measured data (dense vs sparse) have been taken into consideration while preparing the synthetic training data-sets. The DL-based technique has the capability to determine the electron density profile within the plasma. The performance of the proposed deep learning-based approach has been evaluated using three metrics- SSIM, RMSLE, and MAPE. The obtained results show promising performance in estimating the 2D radial profile of the density for the given linear plasma device and affirms the potential of the proposed ML-based approach in plasma diagnostics.
Published: 2023

48. Shape from Shading for Robotic Manipulation

Author: Chaudhury, Arkadeep Narayan, Keselman, Leonid, and Atkeson, Christopher G.
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition
Abstract: Controlling illumination can generate high quality information about object surface normals and depth discontinuities at a low computational cost. In this work we demonstrate a robot workspace-scaled controlled illumination approach that generates high quality information for table top scale objects for robotic manipulation. With our low angle of incidence directional illumination approach, we can precisely capture surface normals and depth discontinuities of monochromatic Lambertian objects. We show that this approach to shape estimation is 1) valuable for general purpose grasping with a single point vacuum gripper, 2) can measure the deformation of known objects, and 3) can estimate pose of known objects and track unknown objects in the robot's workspace., Comment: Project webpage: https://arkadeepnc.github.io/projects/active_workspace/index.html
Published: 2023

49. Spontaneous Emulsification: Elucidation of the Local Processes

Author: Kullappan, Monicka, Patel, Wes, and Chaudhury, Manoj K.
Subjects: Condensed Matter - Soft Condensed Matter, Physics - Chemical Physics
Abstract: Micro and/or Nano sized emulsions are formed when an organic liquid gently comes in contact with water in the presence of a surfactant, where no external agitation is required. Many years of research made it clear that the driving force for spontaneous emulsification arises from the differences of the chemical potentials of various components in the organic and aqueous phases, which triggers diffusion coupled hydrodynamic fluctuation. While extraordinary theoretical developments have taken place that attempted to describe these processes within the scopes of equilibrium and non-equilibrium thermodynamics, the local processes underlying the spontaneous emulsification, however, still remain elusive. In this research, we investigate the local processes that involve the transfer of surfactant as well as water from one phase to another (i.e. water to oil), which results in the formation of water-in-oil emulsion in the organic phase and, subsequently. Thes emulsions invert into oil-in-water emulsion, rather abruptly, as they cross the phase boundary. Studies based on UV spectroscopy and molecular dynamics indicate that these processes may involve explosive events and subsequent assembly of the fragments to other organized structures which are reminiscent of cusp catastrophe proposed earlier by Dickinson. These processes lead to either to a strong or a weak fluctuation of the component concentrations below the interface that also becomes evident in the fast (athermal) diffusion of the emulsion droplets from the interfacial region farther into the bulk water. These events can be arrested suitably with polymeric additives.
Published: 2023

50. Adhesive Release via Elasto-Osmotic Stress Driven Surface Instability

Author: Al-sakkaf, Khulood, Kullappan, Monicka, and Chaudhury, Manoj K.
Subjects: Condensed Matter - Soft Condensed Matter, Physics - Applied Physics
Abstract: Recent studies demonstrated that an elastomer containing hygroscopic inclusions absorbs moisture and swell. Here we show that a thin film of such an elastomer bonded to a rigid substrate undergoes morphological instability upon absorption of water, the wavelength of which increases linearly with its thickness. As the driving force for such a morphological instability arises from the difference of the chemical potential of water between its source and that in the film, its development is slowed down as the salinity of the water increases. Nonetheless, the wavelength of the fully developed morphology, but not its amplitude, is independent of the salinity. We also demonstrate that if a domed disk shaped adherent is attached to the hygro-elastomeric film before moisture absorption, the elastic force generated during the morphological transition is able to dislodge it completely without the need of any external force. These patterns, once developed in pure water, is subdued when the salinity of water increases or if it is exposed to dry air. They re-emerge when the film is immersed in water again. Such an active response could be important in fouling release when a ship coated with such a hygro-elastomer changes its location during its long travel through sea, where salinity varies from place to place.
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

3,611 results on '"Chaudhury, P."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources