21,451 results on '"P. P. Patil"'
Search Results
2. Characterizing the Molecular Gas in Infrared Bright Galaxies with CARMA
- Author
-
Alatalo, Katherine, Petric, Andreea O., Lanz, Lauranne, Rowlands, Kate, U, Vivian, Larson, Kirsten L., Armus, Lee, Barcos-Muñoz, Loreto, Evans, Aaron S., Koda, Jin, Luo, Yuanze, Medling, Anne M., Nyland, Kristina E., Otter, Justin A., Patil, Pallavi, Peñaloza, Fernando, Salim, Diane, Sanders, David B., Sazonova, Elizaveta, Skarbinski, Maya, Song, Yiqing, Treister, Ezequiel, and Urry, C. Meg
- Subjects
Astrophysics - Astrophysics of Galaxies - Abstract
We present the CO(1-0) maps of 28 infrared-bright galaxies from the Great Observatories All-Sky Luminous Infrared Galaxy Survey (GOALS) taken with the Combined Array for Research in Millimeter Astronomy (CARMA). We detect 100GHz continuum in 16 of 28 galaxies, which trace both active galactic nuclei (AGNs) and compact star-forming cores. The GOALS galaxies show a variety of molecular gas morphologies, though in the majority of cases, the average velocity fields show a gradient consistent with rotation. We fit the full continuum SEDs of each of the source using either MAGPHYS or SED3FIT (if there are signs of an AGN) to derive the total stellar mass, dust mass, and star formation rates of each object. We adopt a value determined from luminous and ultraluminous infrared galaxies (LIRGs and ULIRGs) of $\alpha_{\rm CO}=1.5^{+1.3}_{-0.8}~M_\odot$ (K km s$^{-1}$ pc$^2)^{-1}$, which leads to more physical values for $f_{\rm mol}$ and the gas-to-dust ratio. Mergers tend to have the highest gas-to-dust ratios. We assume the cospatiality of the molecular gas and star formation, and plot the sample on the Schmidt-Kennicutt relation, we find that they preferentially lie above the line set by normal star-forming galaxies. This hyper-efficiency is likely due to the increased turbulence in these systems, which decreases the freefall time compared to star-forming galaxies, leading to "enhanced" star formation efficiency. Line wings are present in a non-negligible subsample (11/28) of the CARMA GOALS sources and are likely due to outflows driven by AGNs or star formation, gas inflows, or additional decoupled gas components., Comment: 29 pages, 4 tables, 11 figures, Accepted by the Astrophysical Journal
- Published
- 2024
3. Rapid Assessment of Stable Crystal Structures in Single Phase High Entropy Alloys Via Graph Neural Network Based Surrogate Modelling
- Author
-
Beaver, Nicholas, Dive, Aniruddha, Wong, Marina, Shimanuki, Keita, Patil, Ananya, Ferrell, Anthony, and Kivy, Mohsen B.
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Disordered Systems and Neural Networks - Abstract
In an effort to develop a rapid, reliable, and cost-effective method for predicting the structure of single-phase high entropy alloys, a Graph Neural Network (ALIGNN-FF) based approach was introduced. This method was successfully tested on 132 different high entropy alloys, and the results were analyzed and compared with density functional theory and valence electron concentration calculations. Additionally, the effects of various factors, including lattice parameters and the number of supercells with unique atomic configurations, on the prediction accuracy were investigated. The ALIGNN-FF based approach was subsequently used to predict the structure of a novel cobalt-free 3d high entropy alloy, and the result was experimentally verified., Comment: 16 pages, 8 Figures. To be published in Results Materials
- Published
- 2024
4. Implicit Regularization Paths of Weighted Neural Representations
- Author
-
Du, Jin-Hong and Patil, Pratik
- Subjects
Computer Science - Machine Learning ,Mathematics - Statistics Theory ,Statistics - Machine Learning - Abstract
We study the implicit regularization effects induced by (observation) weighting of pretrained features. For weight and feature matrices of bounded operator norms that are infinitesimally free with respect to (normalized) trace functionals, we derive equivalence paths connecting different weighting matrices and ridge regularization levels. Specifically, we show that ridge estimators trained on weighted features along the same path are asymptotically equivalent when evaluated against test vectors of bounded norms. These paths can be interpreted as matching the effective degrees of freedom of ridge estimators fitted with weighted features. For the special case of subsampling without replacement, our results apply to independently sampled random features and kernel features and confirm recent conjectures (Conjectures 7 and 8) of the authors on the existence of such paths in Patil et al. We also present an additive risk decomposition for ensembles of weighted estimators and show that the risks are equivalent along the paths when the ensemble size goes to infinity. As a practical consequence of the path equivalences, we develop an efficient cross-validation method for tuning and apply it to subsampled pretrained representations across several models (e.g., ResNet-50) and datasets (e.g., CIFAR-100)., Comment: 19 pages for main and 19 pages for appendix
- Published
- 2024
5. HER2 and FISH Status Prediction in Breast Biopsy H&E-Stained Images Using Deep Learning
- Author
-
Sekhar, Ardhendu, Goel, Vrinda, Jain, Garima, Patil, Abhijeet, Gupta, Ravi Kant, and Sethi, Amit
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
The current standard for detecting human epidermal growth factor receptor 2 (HER2) status in breast cancer patients relies on HER2 amplification, identified through fluorescence in situ hybridization (FISH) or immunohistochemistry (IHC). However, hematoxylin and eosin (H\&E) tumor stains are more widely available, and accurately predicting HER2 status using H\&E could reduce costs and expedite treatment selection. Deep Learning algorithms for H&E have shown effectiveness in predicting various cancer features and clinical outcomes, including moderate success in HER2 status prediction. In this work, we employed a customized weak supervision classification technique combined with MoCo-v2 contrastive learning to predict HER2 status. We trained our pipeline on 182 publicly available H&E Whole Slide Images (WSIs) from The Cancer Genome Atlas (TCGA), for which annotations by the pathology team at Yale School of Medicine are publicly available. Our pipeline achieved an Area Under the Curve (AUC) of 0.85 across four different test folds. Additionally, we tested our model on 44 H&E slides from the TCGA-BRCA dataset, which had an HER2 score of 2+ and included corresponding HER2 status and FISH test results. These cases are considered equivocal for IHC, requiring an expensive FISH test on their IHC slides for disambiguation. Our pipeline demonstrated an AUC of 0.81 on these challenging H&E slides. Reducing the need for FISH test can have significant implications in cancer treatment equity for underserved populations.
- Published
- 2024
6. Predicting Solar Energy Generation with Machine Learning based on AQI and Weather Features
- Author
-
Shah, Arjun, Viswanath, Varun, Gandhi, Kashish, and Patil, Nilesh Madhukar
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
This paper addresses the pressing need for an accurate solar energy prediction model, which is crucial for efficient grid integration. We explore the influence of the Air Quality Index and weather features on solar energy generation, employing advanced Machine Learning and Deep Learning techniques. Our methodology uses time series modeling and makes novel use of power transform normalization and zero-inflated modeling. Various Machine Learning algorithms and Conv2D Long Short-Term Memory model based Deep Learning models are applied to these transformations for precise predictions. Results underscore the effectiveness of our approach, demonstrating enhanced prediction accuracy with Air Quality Index and weather features. We achieved a 0.9691 $R^2$ Score, 0.18 MAE, 0.10 RMSE with Conv2D Long Short-Term Memory model, showcasing the power transform technique's innovation in enhancing time series forecasting for solar energy generation. Such results help our research contribute valuable insights to the synergy between Air Quality Index, weather features, and Deep Learning techniques for solar energy prediction., Comment: 10 pages, 11 figures
- Published
- 2024
- Full Text
- View/download PDF
7. Towards a theory of phase transitions in quantum control landscapes
- Author
-
Beato, Nicolò, Patil, Pranay, and Bukov, Marin
- Subjects
Quantum Physics ,Condensed Matter - Other Condensed Matter ,Condensed Matter - Quantum Gases ,Condensed Matter - Statistical Mechanics ,Condensed Matter - Strongly Correlated Electrons - Abstract
Control landscape phase transitions (CLPTs) occur as abrupt changes in the cost function landscape upon varying a control parameter, and can be revealed by non-analytic points in statistical order parameters. A prime example are quantum speed limits (QSL) which mark the onset of controllability as the protocol duration is increased. Here we lay the foundations of an analytical theory for CLPTs by developing Dyson, Magnus, and cumulant expansions for the cost function that capture the behavior of CLPTs with a controlled precision. Using linear and quadratic stability analysis, we reveal that CLPTs can be associated with different types of instabilities of the optimal protocol. This allows us to explicitly relate CLPTs to critical structural rearrangements in the extrema of the control landscape: utilizing path integral methods from statistical field theory, we trace back the critical scaling of the order parameter at the QSL to the topological and geometric properties of the set of optimal protocols, such as the number of connected components and its dimensionality. We verify our predictions by introducing a numerical sampling algorithm designed to explore this optimal set via a homotopic stochastic update rule. We apply this new toolbox explicitly to analyze CLPTs in the single- and two-qubit control problems whose landscapes are analytically tractable, and compare the landscapes for bang-bang and continuous protocols. Our work provides the first steps towards a systematic theory of CLPTs and paves the way for utilizing statistical field theory techniques for generic complex control landscapes., Comment: 38 pages, 24 figures
- Published
- 2024
8. Undominated monopoly regulation
- Author
-
Mishra, Debasis and Patil, Sanket
- Subjects
Economics - Theoretical Economics - Abstract
We study undominated mechanisms with transfers for regulating a monopolist who privately observes the marginal cost of production. We show that in any undominated mechanism, there is a quantity floor, which depends only on the primitives, and the regulator's operation decision is stochastic only if the monopolist produces at the quantity floor. We provide a near-complete characterization of the set of undominated mechanisms and use it to (a) provide a foundation for deterministic mechanisms, (b) show that the efficient mechanism is dominated, and (c) derive a max-min optimal regulatory mechanism.
- Published
- 2024
9. Hybrid Semantic Search: Unveiling User Intent Beyond Keywords
- Author
-
Ahluwalia, Aman, Sutradhar, Bishwajit, Ghosh, Karishma, Yadav, Indrapal, Sheetal, Arpan, and Patil, Prashant
- Subjects
Computer Science - Information Retrieval ,Computer Science - Artificial Intelligence - Abstract
This paper addresses the limitations of traditional keyword-based search in understanding user intent and introduces a novel hybrid search approach that leverages the strengths of non-semantic search engines, Large Language Models (LLMs), and embedding models. The proposed system integrates keyword matching, semantic vector embeddings, and LLM-generated structured queries to deliver highly relevant and contextually appropriate search results. By combining these complementary methods, the hybrid approach effectively captures both explicit and implicit user intent.The paper further explores techniques to optimize query execution for faster response times and demonstrates the effectiveness of this hybrid search model in producing comprehensive and accurate search outcomes.
- Published
- 2024
10. Nonlinear Quantum Optics at a Topological Interface Enabled by Defect Engineering
- Author
-
Hallacy, L., Martin, N. J., Mehrabad, M. Jalali, Hallett, D., Chen, X., Dost, R., Foster, A., Brunswick, L., Fenzl, A., Clarke, E., Patil, P. K., Fox, A. M, Skolnick, M. S., and Wilson, L. R.
- Subjects
Physics - Optics ,Quantum Physics - Abstract
The integration of topology into photonics has generated a new design framework for constructing robust and unidirectional waveguides, which are not feasible with traditional photonic devices. Here, we overcome current barriers to the successful integration of quantum emitters such as quantum dots (QDs) into valley-Hall (VH) topological waveguides, utilising photonic defects at the topological interface to stabilise the local charge environment and inverse design for efficient topological-conventional mode conversion. By incorporating QDs within defects of VH-photonic crystals, we demonstrate the first instances of single-photon resonant fluorescence and resonant transmission spectroscopy of a quantum emitter at a topological waveguide interface. Our results bring together topological photonics with optical nonlinear effects at the single-photon level, offering a new avenue to investigate the interaction between topology and quantum nonlinear systems.
- Published
- 2024
11. Stabilizer Entanglement Distillation and Efficient Fault-Tolerant Encoder
- Author
-
Shi, Yu, Patil, Ashlesha, and Guha, Saikat
- Subjects
Quantum Physics - Abstract
Entanglement is essential for quantum information processing but is limited by noise. We address this by developing high-yield entanglement distillation protocols with several advancements. (1) We extend the 2-to-1 recurrence entanglement distillation protocol to higher-rate n-to-(n-1) protocols that can correct any single-qubit errors. These protocols are evaluated through numerical simulations focusing on fidelity and yield. We also outline a method to adapt any classical error-correcting code for entanglement distillation, where the code can correct both bit-flip and phase-flip errors by incorporating Hadamard gates. (2) We propose a constant-depth decoder for stabilizer codes that transforms logical states into physical ones using single-qubit measurements. This decoder is applied to entanglement distillation protocols, reducing circuit depth and enabling protocols derived from advanced quantum error-correcting codes. We demonstrate this by evaluating the circuit complexity for entanglement distillation protocols based on surface codes and quantum convolutional codes. (3) Our stabilizer entanglement distillation techniques advance quantum computing. We propose a fault-tolerant protocol for constant-depth encoding and decoding of arbitrary quantum states, applicable to quantum low-density parity-check (qLDPC) codes and surface codes. This protocol is feasible with state-of-the-art reconfigurable atom arrays and surpasses the limits of conventional logarithmic depth encoders. Overall, our study integrates stabilizer formalism, measurement-based quantum computing, and entanglement distillation, advancing both quantum communication and computing., Comment: 19 pages, 7 figures
- Published
- 2024
12. LSST: Learned Single-Shot Trajectory and Reconstruction Network for MR Imaging
- Author
-
Aggarwal, Hemant Kumar, Chatterjee, Sudhanya, Shanbhag, Dattesh, Patil, Uday, and Hari, K. V. S.
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Single-shot magnetic resonance (MR) imaging acquires the entire k-space data in a single shot and it has various applications in whole-body imaging. However, the long acquisition time for the entire k-space in single-shot fast spin echo (SSFSE) MR imaging poses a challenge, as it introduces T2-blur in the acquired images. This study aims to enhance the reconstruction quality of SSFSE MR images by (a) optimizing the trajectory for measuring the k-space, (b) acquiring fewer samples to speed up the acquisition process, and (c) reducing the impact of T2-blur. The proposed method adheres to physics constraints due to maximum gradient strength and slew-rate available while optimizing the trajectory within an end-to-end learning framework. Experiments were conducted on publicly available fastMRI multichannel dataset with 8-fold and 16-fold acceleration factors. An experienced radiologist's evaluation on a five-point Likert scale indicates improvements in the reconstruction quality as the ACL fibers are sharper than comparative methods.
- Published
- 2024
13. Covariant Jacobi-Legendre expansion for total energy calculations within the projector-augmented-wave formalism
- Author
-
Focassio, Bruno, Domina, Michelangelo, Patil, Urvesh, Fazzio, Adalberto, and Sanvito, Stefano
- Subjects
Condensed Matter - Materials Science ,Condensed Matter - Strongly Correlated Electrons - Abstract
Machine-learning models can be trained to predict the converged electron charge density of a density functional theory (DFT) calculation. In general, the value of the density at a given point in space is invariant under global translations and rotations having that point as a centre. Hence, one can construct locally invariant machine-learning density predictors. However, the widely used projector augmented wave (PAW) implementation of DFT requires the evaluation of the one-center augmentation contributions, that are not rotationally invariant. Building on our recently proposed Jacobi-Legendre charge-density scheme, we construct a covariant Jacobi-Legendre model capable of predicting the local occupancies needed to compose the augmentation charge density. Our formalism is then applied to the prediction of the energy barrier for the 1H-to-1T phase transition of two-dimensional MoS$_2$. With extremely modest training, the model is capable of performing a non-self-consistent nudged elastic band calculation at virtually the same accuracy as a fully DFT-converged one, thus saving thousands of self-consistent DFT steps. Furthermore, at variance with machine-learning force fields, the charge density is here available for any nudged elastic band image, so that we can trace the evolution of the electronic structure across the phase transition., Comment: 11 pages, 3 figures
- Published
- 2024
14. Radiance Fields for Robotic Teleoperation
- Author
-
Wilder-Smith, Maximum, Patil, Vaishakh, and Hutter, Marco
- Subjects
Computer Science - Robotics - Abstract
Radiance field methods such as Neural Radiance Fields (NeRFs) or 3D Gaussian Splatting (3DGS), have revolutionized graphics and novel view synthesis. Their ability to synthesize new viewpoints with photo-realistic quality, as well as capture complex volumetric and specular scenes, makes them an ideal visualization for robotic teleoperation setups. Direct camera teleoperation provides high-fidelity operation at the cost of maneuverability, while reconstruction-based approaches offer controllable scenes with lower fidelity. With this in mind, we propose replacing the traditional reconstruction-visualization components of the robotic teleoperation pipeline with online Radiance Fields, offering highly maneuverable scenes with photorealistic quality. As such, there are three main contributions to state of the art: (1) online training of Radiance Fields using live data from multiple cameras, (2) support for a variety of radiance methods including NeRF and 3DGS, (3) visualization suite for these methods including a virtual reality scene. To enable seamless integration with existing setups, these components were tested with multiple robots in multiple configurations and were displayed using traditional tools as well as the VR headset. The results across methods and robots were compared quantitatively to a baseline of mesh reconstruction, and a user study was conducted to compare the different visualization methods. For videos and code, check out https://leggedrobotics.github.io/rffr.github.io/., Comment: 8 pages, 10 figures, Accepted to IROS 2024
- Published
- 2024
15. Minimum Time Consensus of Multi-agent System under Fuel Constraints
- Author
-
Rautela, Akansha, Patil, Deepak, Mulla, Ameer, and Kar, Indra Narayan
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
This work addresses the problem of finding a consensus point in the state space ($\mathbb{R}^2$) for a multi-agent system that is comprised of $N$ identical double integrator agents. It is assumed that each agent operates under constrained control input (i.e., $|u_i(t)| \leq 1$ $\forall i = 1, \hdots N$). Further, a fixed fuel budget is also assumed i.e., the total amount of cumulative input that can be expended is limited by $\int_0^{t_f}|u(t)|dt \le \beta$. First, the attainable set $\mathcal{A}(t,x_0,\beta)$ at time $t$, which is the set of all states that an agent can attain starting from initial conditions $x_0$ under the fuel budget constraints at time $t$ is computed for every agent. This attainable set is a convex set for all $t\ge0$. Then the minimum time to consensus is the minimum time $\bar{t}$ at which attainable sets of all agents intersect, and the consensus point is the point of intersection. A closed-form expression for the minimum time consensus point is provided for the case of three agents. Then, using Helly's theorem, the intersection will be non-empty at a time when all the $N \choose 3$ triplets of agents have non-empty intersection. The computation of minimum time consensus for all $N \choose 3$ triplets is performed independently and can be distributed among all the $N$ agents. Finally, the overall minimum time to consensus is given by the triplet that has the highest minimum time to consensus. Further, the intersection of all the attainable sets of this triplet gives the minimum time consensus point for all $N$ agents.
- Published
- 2024
16. Improving Domain Adaptation Through Class Aware Frequency Transformation
- Author
-
Kumar, Vikash, Patil, Himanshu, Lal, Rohit, and Chakraborty, Anirban
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
In this work, we explore the usage of the Frequency Transformation for reducing the domain shift between the source and target domain (e.g., synthetic image and real image respectively) towards solving the Domain Adaptation task. Most of the Unsupervised Domain Adaptation (UDA) algorithms focus on reducing the global domain shift between labelled source and unlabelled target domains by matching the marginal distributions under a small domain gap assumption. UDA performance degrades for the cases where the domain gap between source and target distribution is large. In order to bring the source and the target domains closer, we propose a novel approach based on traditional image processing technique Class Aware Frequency Transformation (CAFT) that utilizes pseudo label based class consistent low-frequency swapping for improving the overall performance of the existing UDA algorithms. The proposed approach, when compared with the state-of-the-art deep learning based methods, is computationally more efficient and can easily be plugged into any existing UDA algorithm to improve its performance. Additionally, we introduce a novel approach based on absolute difference of top-2 class prediction probabilities (ADT2P) for filtering target pseudo labels into clean and noisy sets. Samples with clean pseudo labels can be used to improve the performance of unsupervised learning algorithms. We name the overall framework as CAFT++. We evaluate the same on the top of different UDA algorithms across many public domain adaptation datasets. Our extensive experiments indicate that CAFT++ is able to achieve significant performance gains across all the popular benchmarks., Comment: Accepted at the International Journal of Computer Vision
- Published
- 2024
- Full Text
- View/download PDF
17. HDL-GPT: High-Quality HDL is All You Need
- Author
-
Kumar, Bhuvnesh, Nanda, Saurav, Parthasarathy, Ganapathy, Patil, Pawan, Tsai, Austin, and Choudhary, Parivesh
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
This paper presents Hardware Description Language Generative Pre-trained Transformers (HDL-GPT), a novel approach that leverages the vast repository of open-source High Definition Language (HDL) codes to train superior quality large code models. The core premise of this paper is the hypothesis that high-quality HDL is all you need to create models with exceptional performance and broad zero-shot generalization abilities. The paper elucidates the methods employed for the curation and augmentation of large corpora from open-source HDL code, transforming highly variable quality data into high-quality data through careful prompting and context maintenance. We demonstrate that the careful selection, filtering, and augmentation of data across HDLs can yield powerful models that surpass current state-of-the-art models. We also explore the impact of different fine-tuning methods on the quality of results. We describe experimental results across a range of fine-tuned SOTA LLMs, substantiating our claims. We demonstrate improvements of 50% to 200% over SOTA HDL models on current benchmarks in tasks ranging from HDL circuit explanations, code generation, formal and simulation testbench creation, triaging bugs, and fixing them. HDL-GPT opens new avenues for the development of advanced model training techniques for circuit design tasks., Comment: DAC 2024 Invited Paper
- Published
- 2024
18. Exploring the Design of Collaborative Applications via the Lens of NDN Workspace
- Author
-
Yu, Tianyuan, Ma, Xinyu, Patil, Varun, Kocaogullar, Yekta, and Zhang, Lixia
- Subjects
Computer Science - Networking and Internet Architecture - Abstract
Metaverse applications desire to communicate with semantically identified objects among a diverse set of cyberspace entities, such as cameras for collecting images from, sensors for sensing environment, and users collaborating with each other, all could be nearby or far away, in a timely and secure way. However, supporting the above function faces networking challenges. Today's metaverse implementations are, by and large, use secure transport connections to communicate with cloud servers instead of letting participating entities communicate directly. In this paper, we use the design and implementation of NDN Workspace, a web-based, multi-user collaborative app to showcase a new way to networking that supports many-to-many secure data exchanges among communicating entities directly. NDN Workspace users establish trust relations among each other, exchange URI-identified objects directly, and can collaborate through intermittent connectivity, all in the absence of cloud servers. Its data-centric design offers an exciting new approach to metaverse app development.
- Published
- 2024
19. Secure Web Objects: Building Blocks for Metaverse Interoperability and Decentralization
- Author
-
Yu, Tianyuan, Ma, Xinyu, Patil, Varun, Kocaogullar, Yekta, Zhang, Yulong, Burke, Jeff, Kutscher, Dirk, and Zhang, Lixia
- Subjects
Computer Science - Networking and Internet Architecture ,Computer Science - Distributed, Parallel, and Cluster Computing ,H.3.5 - Abstract
This position paper explores how to support the Web's evolution through an underlying data-centric approach that better matches the data-orientedness of modern and emerging applications. We revisit the original vision of the Web as a hypermedia system that supports document composability and application interoperability via name-based data access. We propose the use of secure web objects (SWO), a data-oriented communication approach that can reduce complexity, centrality, and inefficiency, particularly for collaborative and local-first applications, such as the Metaverse and other collaborative applications. SWO are named, signed, application-defined objects that are secured independently of their containers or communications channels, an approach that leverages the results from over a decade-long data-centric networking research. This approach does not require intermediation by aggregators of identity, storage, and other services that are common today. We present a brief design overview, illustrated through prototypes for two editors of shared hypermedia documents: one for 3D and one for LaTeX. We also discuss our findings and suggest a roadmap for future research., Comment: 9 pages
- Published
- 2024
20. SAM Fewshot Finetuning for Anatomical Segmentation in Medical Images
- Author
-
Xie, Weiyi, Willems, Nathalie, Patil, Shubham, Li, Yang, and Kumar, Mayank
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,I.4.6 ,I.5.4 ,I.5.1 - Abstract
We propose a straightforward yet highly effective few-shot fine-tuning strategy for adapting the Segment Anything (SAM) to anatomical segmentation tasks in medical images. Our novel approach revolves around reformulating the mask decoder within SAM, leveraging few-shot embeddings derived from a limited set of labeled images (few-shot collection) as prompts for querying anatomical objects captured in image embeddings. This innovative reformulation greatly reduces the need for time-consuming online user interactions for labeling volumetric images, such as exhaustively marking points and bounding boxes to provide prompts slice by slice. With our method, users can manually segment a few 2D slices offline, and the embeddings of these annotated image regions serve as effective prompts for online segmentation tasks. Our method prioritizes the efficiency of the fine-tuning process by exclusively training the mask decoder through caching mechanisms while keeping the image encoder frozen. Importantly, this approach is not limited to volumetric medical images, but can generically be applied to any 2D/3D segmentation task. To thoroughly evaluate our method, we conducted extensive validation on four datasets, covering six anatomical segmentation tasks across two modalities. Furthermore, we conducted a comparative analysis of different prompting options within SAM and the fully-supervised nnU-Net. The results demonstrate the superior performance of our method compared to SAM employing only point prompts (approximately 50% improvement in IoU) and performs on-par with fully supervised methods whilst reducing the requirement of labeled data by at least an order of magnitude., Comment: 9 pages, Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2024
- Published
- 2024
21. Crystal Growth, Terahertz Generation and Optical Characterization of Sodium Mesitylene Sulphonate (SMS)Crystal
- Author
-
Murtunge, Yamuna, Patil, Vidhyadhar, Puranik, Ruturaj, S, Jayakrishnan S, Bansal, D, Maity, Arijit, Venkatramani, Ravindra, Kulkarni, S. B., Thamizhavel, A, and Prabhu, S. S.
- Subjects
Physics - Optics - Abstract
An optically high-quality single crystal of sodium mesitylene sulfonate crystal was successfully grown by a slow evaporation method using methanol as solvent at room temperature. Single-crystal XRD has characterized the material and belongs to a monoclinic structure with a C2 space group. Functional groups were determined using Fourier-transformed infrared spectroscopy. The optical quality of the generated crystal was evaluated using UV-Vis NIR spectral analysis, which is transparent in the range of 300-1500 nm. We report the optical properties using terahertz time-domain spectroscopy (THz-TDS) and THz generation using crystal.
- Published
- 2024
22. A Methodology for Improving the Quality of the Research Article Publications in Engineering Institutions in India: A Case Study
- Author
-
Rajkumar Bhimgonda Patil, Prachi Vinod Ingle, and Padmakar A. Deshmukh
- Abstract
Research article publication is often considered a critical indicator of academic institutions' success and productivity. It improves the institution's reputation, attracts talented students and faculty members, and increases the institution's chances of receiving funding opportunities from different funding agencies. This paper provides a reliable and sustainable methodology for improving the quality and quantity of research article publications for engineering institutions in India. The various tools, techniques, and initiatives that promote research culture and improve its outcome in terms of research papers are also discussed. A case study of Pimpri Chinchwad College of Engineering (PCCOE), Pune, India, depicts how predictive, prescriptive, descriptive, and diagnostic data analytics approaches help to identify the barriers in the research article publications in academic institutions and provides the ways to overcome them. It also helps to set the publication targets and develop the path to perceive the targets. The outcomes and effectiveness of the case study are discussed using the papers published in Scopus, Web of Science, and Google Scholar databases. The challenges, opportunities, and recommendations are also provided for the smooth and effective implementation of the developed methodologies.
- Published
- 2024
- Full Text
- View/download PDF
23. A Teacher Is Worth A Million Instructions
- Author
-
Kothari, Nikhil, Nayak, Ravindra, Shetty, Shreyas, Patil, Amey, and Garera, Nikesh
- Subjects
Computer Science - Machine Learning - Abstract
Large Language Models(LLMs) have shown exceptional abilities, yet training these models can be quite challenging. There is a strong dependence on the quality of data and finding the best instruction tuning set. Further, the inherent limitations in training methods create substantial difficulties to train relatively smaller models with 7B and 13B parameters. In our research, we suggest an improved training method for these models by utilising knowledge from larger models, such as a mixture of experts (8x7B) architectures. The scale of these larger models allows them to capture a wide range of variations from data alone, making them effective teachers for smaller models. Moreover, we implement a novel post-training domain alignment phase that employs domain-specific expert models to boost domain-specific knowledge during training while preserving the model's ability to generalise. Fine-tuning Mistral 7B and 2x7B with our method surpasses the performance of state-of-the-art language models with more than 7B and 13B parameters: achieving up to $7.9$ in MT-Bench and $93.04\%$ on AlpacaEval., Comment: 7 pages, 4 figures
- Published
- 2024
24. Adaptive Deep Neural Network-Based Control Barrier Functions
- Author
-
Sweatland, Hannah M., Patil, Omkar Sudhir, and Dixon, Warren E.
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
Safety constraints of nonlinear control systems are commonly enforced through the use of control barrier functions (CBFs). Uncertainties in the dynamic model can disrupt forward invariance guarantees or cause the state to be restricted to an overly conservative subset of the safe set. In this paper, adaptive deep neural networks (DNNs) are combined with CBFs to produce a family of controllers that ensure safety while learning the system's dynamics in real-time without the requirement for pre-training. By basing the least squares adaptation law on a state derivative estimator-based identification error, the DNN parameter estimation error is shown to be uniformly ultimately bounded. The convergent bound on the parameter estimation error is then used to formulate CBF-constraints in an optimization-based controller to guarantee safety despite model uncertainty. Furthermore, the developed method is applicable for use under intermittent loss of state-feedback. Comparative simulation results demonstrate the ability of the developed method to ensure safety in an adaptive cruise control problem and when feedback is lost, unlike baseline methods., Comment: 7 pages, 2 figures, 28 references
- Published
- 2024
25. Development of Volume Produced Negative Ion Source using a CCRF Discharge
- Author
-
Singh, Pawandeep, Dahiya, Swati, Pandey, Avnish, Patil, Yashashri, and Karkari, Shantanu
- Subjects
Physics - Plasma Physics - Abstract
This work shows the development of a volume-produced negative ion source that consists of annular parallel plates driven by a 13.56 MHz capacitively coupled radio frequency in a push-pull configuration. This source shows advantages in controlling plasma conditions by varying the pressure, power, and applied axial magnetic field. It is found that the push-pull configuration allows the plasma potential to remain in the range of 20 to 40 Volts. Conversely, the application of a magnetic field helps serves to augment the production of negative ions in the central hollow part of the annular plate. Further, a plausible explanation to the obtained experimental results is presented., Comment: 5 pages, 7 figures
- Published
- 2024
26. Sheath effects with thermal electrons on the resonance frequency of a DC-biased hairpin probe
- Author
-
Singh, Pawandeep, Pandey, Avnish, Dahiya, Swati, Patil, Yashashri, Sirse, Nishant, and Karkari, Shantanu
- Subjects
Physics - Plasma Physics - Abstract
The dielectric constant of a sheath, whether ionic or electronic, formed around the cylindrical limbs of a hairpin probe, is often considered the same as that of a vacuum. However, this assumption does not hold true for electron sheaths and electron-permeating ionic sheaths, resulting in a deviation of the sheath dielectric constant from that of a vacuum. This deviation significantly influences the effective dielectric between the cylindrical limbs. As a result, it impacts the theoretically estimated resonance frequency characteristic curve of a DC-biased hairpin probe. In this study, we investigate the influence of electron temperature on the sheath dielectric and, consequently, on the resonance frequency characteristic curve. The findings shows that electron temperature primarily determines the resonance frequency characteristic curve. With increasing electron temperature, the peak in the resonance frequency characteristic curve shifts towards higher positive probe bias values and exhibits a broadening near the maxima instead of a sharp peak. This broadening near the maxima has also been validated with an experimentally measured resonance frequency characteristic curve in a capacitively coupled argon discharge., Comment: 9 pages, 13 figures
- Published
- 2024
27. Advancements in Orthopaedic Arm Segmentation: A Comprehensive Review
- Author
-
Swami, Abhishek, Farande, Snehal, Patil, Atharv, Parle, Atharva, Mane, Vivekanand, and Thorat, Prathamesh
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,68T07 - Abstract
The most recent advances in medical imaging that have transformed diagnosis, especially in the case of interpreting X-ray images, are actively involved in the healthcare sector. The advent of digital image processing technology and the implementation of deep learning models such as Convolutional Neural Networks (CNNs) have made the analysis of X-rays much more accurate and efficient. In this article, some essential techniques such as edge detection, region-growing technique, and thresholding approach, and the deep learning models such as variants of YOLOv8-which is the best object detection and segmentation framework-are reviewed. We further investigate that the traditional image processing techniques like segmentation are very much simple and provides the alternative to the advanced methods as well. Our review gives useful knowledge on the practical usage of the innovative and traditional approaches of manual X-ray interpretation. The discovered information will help professionals and researchers to gain more profound knowledge in digital interpretation techniques in medical imaging., Comment: 29 pages, 20 figures
- Published
- 2024
- Full Text
- View/download PDF
28. Curating Stopwords in Marathi: A TF-IDF Approach for Improved Text Analysis and Information Retrieval
- Author
-
Chavan, Rohan, Patil, Gaurav, Madle, Vishal, and Joshi, Raviraj
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Stopwords are commonly used words in a language that are often considered to be of little value in determining the meaning or significance of a document. These words occur frequently in most texts and don't provide much useful information for tasks like sentiment analysis and text classification. English, which is a high-resource language, takes advantage of the availability of stopwords, whereas low-resource Indian languages like Marathi are very limited, standardized, and can be used in available packages, but the number of available words in those packages is low. Our work targets the curation of stopwords in the Marathi language using the MahaCorpus, with 24.8 million sentences. We make use of the TF-IDF approach coupled with human evaluation to curate a strong stopword list of 400 words. We apply the stop word removal to the text classification task and show its efficacy. The work also presents a simple recipe for stopword curation in a low-resource language. The stopwords are integrated into the mahaNLP library and publicly available on https://github.com/l3cube-pune/MarathiNLP ., Comment: Accepted at I2CT 2024
- Published
- 2024
- Full Text
- View/download PDF
29. Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM
- Author
-
Muddu, Sri Raghava, Rangaraju, Rupasai, Siledar, Tejpalsingh, Nath, Swaroop, Bhattacharyya, Pushpak, Nath, Swaprava, Banerjee, Suman, Patil, Amey, Chelliah, Muthusamy, Singh, Sudhanshu Shekhar, and Garera, Nikesh
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While Large Language Models (LLMs) have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limitations. To mitigate, we propose a scalable framework called Xl-OpSumm that generates summaries incrementally. However, the existing test set, AMASUM has only 560 reviews per product on average. Due to the lack of a test set with thousands of reviews, we created a new test set called Xl-Flipkart by gathering data from the Flipkart website and generating summaries using GPT-4. Through various automatic evaluations and extensive analysis, we evaluated the framework's efficiency on two datasets, AMASUM and Xl-Flipkart. Experimental results show that our framework, Xl-OpSumm powered by Llama-3-8B-8k, achieves an average ROUGE-1 F1 gain of 4.38% and a ROUGE-L F1 gain of 3.70% over the next best-performing model.
- Published
- 2024
30. A Framework for Efficient Model Evaluation through Stratification, Sampling, and Estimation
- Author
-
Fogliato, Riccardo, Patil, Pratik, Monfort, Mathew, and Perona, Pietro
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Statistics - Applications - Abstract
Model performance evaluation is a critical and expensive task in machine learning and computer vision. Without clear guidelines, practitioners often estimate model accuracy using a one-time completely random selection of the data. However, by employing tailored sampling and estimation strategies, one can obtain more precise estimates and reduce annotation costs. In this paper, we propose a statistical framework for model evaluation that includes stratification, sampling, and estimation components. We examine the statistical properties of each component and evaluate their efficiency (precision). One key result of our work is that stratification via k-means clustering based on accurate predictions of model performance yields efficient estimators. Our experiments on computer vision datasets show that this method consistently provides more precise accuracy estimates than the traditional simple random sampling, even with substantial efficiency gains of 10x. We also find that model-assisted estimators, which leverage predictions of model accuracy on the unlabeled portion of the dataset, are generally more efficient than the traditional estimates based solely on the labeled data., Comment: To appear at ECCV 2024
- Published
- 2024
31. Model fusion for efficient learning of nonlinear dynamical systems
- Author
-
Kedia, Vatsal, Pinnamaraju, Vivek S., and Patil, Dinesh
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
In the context of model-based control of industrial processes, it is a common practice to develop a data-driven linear dynamical model around a specified operating point. However, in applications involving wider operating conditions, representation of the dynamics using a single linear dynamic model is often inadequate, requiring either a nonlinear model or multiple linear models to accommodate the nonlinear behaviour. While the development of the former suffers from the requirements of extensive experiments spanning multiple levels, significant compromise in the nominal product quality and dealing with unmeasured disturbances over wider operating conditions, the latter faces the challenge of model switch scheduling and inadequate description of dynamics for the operating regions in-between. To overcome these challenges, we propose an efficient approach to obtain a parsimonious nonlinear dynamic model by developing multiple linear models from data at multiple operating points, lifting the data features obtained from individual model simulations to adequately accommodate the underlying nonlinear behaviour and finally, sparse optimization techniques to obtain a parsimonious model. The performance and effectiveness of the proposed algorithm is demonstrated through simulation case studies.
- Published
- 2024
32. Sloshing and spiral structures breeding a putative radio mini-halo in the environment of a cool-core cluster Abell 795
- Author
-
Kadam, S. K., Salunkhe, Sameer, Vagshette, N. D., Paul, Surajit, Sonkamble, Satish S., Pawar, P. K., and Patil, M. K.
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - High Energy Astrophysical Phenomena - Abstract
Spiral structures and cold fronts in X-rays are frequently observed in cool core galaxy clusters. However, studies on radio mini-haloes associated with such spirals and their physical connections are rare. Here, we present the detection of an extended diffuse radio emission entrained in the X-ray spiral structure in a known cool core cluster Abell 795 (A795). Though the cool core is a sign of the relaxed nature of the clusters, our re-analysed 30 ks Chandra X-ray data of cluster A795 confirms the presence of an interesting log spiral structure of X-ray deficit region complemented by an X-ray excess counter spiral in the residual map, exposing its dynamical activity. Our new analysis of 150 $\&$ 325 MHz GMRT archival data of the cluster confirms the detection of a $\sim180$ kpc ultra-steep ($\alpha\sim-2.7$) diffuse radio structure which was previously reported as a candidate radio mini halo from low sensitive survey maps. This radio emission spans the entire spiral structure ($\sim186$ kpc), enclosed by two previously reported cold fronts. Furthermore, SDSS DR13 optical spectra, as well as GALEX's FUV data, show a considerably low total star formation rate of 2.52 M$_{\odot}$ yr$^{-1}$ and having no significant variation in metallicity distribution. We argued that the two-phase (hot and cold) plasma at the cluster core with differential velocity has probably caused the spiral formation and has redistributed the secondary electrons from the central BCG or the pre-accelerated electrons which have been (re-)accelerated by the sloshing turbulence to form the observed candidate radio mini-halo structure. This has been supported by a few previous studies that indicate spiral formation and sloshing turbulence may quench star formation and facilitate smooth metallicity distribution by mixing the gas in the core., Comment: 11 pages, 9 figures, Accepted for publication in MNRAS
- Published
- 2024
- Full Text
- View/download PDF
33. Post-Minkowskian Theory Meets the Spinning Effective-One-Body Approach for Bound-Orbit Waveforms
- Author
-
Buonanno, Alessandra, Mogull, Gustav, Patil, Raj, and Pompili, Lorenzo
- Subjects
General Relativity and Quantum Cosmology ,High Energy Physics - Theory - Abstract
Driven by advances in scattering amplitudes and worldline-based methods, recent years have seen significant progress in our ability to calculate gravitational two-body scattering observables. These observables effectively encapsulate the gravitational two-body problem in the weak-field and high-velocity regime (post-Minkowskian, PM), with applications to the bound two-body problem and gravitational-wave modeling. We leverage PM data to construct a complete inspiral-merger-ringdown waveform model for non-precessing spinning black holes within the effective-one-body (EOB) formalism: SEOBNR-PM. This model is closely based on the highly successful SEOBNRv5 model, used by the LIGO-Virgo-KAGRA Collaboration, with its key new feature being an EOB Hamiltonian derived by matching the two-body scattering angle in a perturbative PM expansion. The model performs remarkably well, showing a median mismatch against 441 numerical-relativity (NR) simulations that is somewhat lower than a similarly calibrated version of SEOBNRv5. Comparisons of the binding energy with NR also demonstrate better agreement than SEOBNRv5, despite the latter containing additional calibration to NR simulations., Comment: 5 pages, 4 figures; supplemental material; attached ancillary Mathematica file
- Published
- 2024
34. The 2024 Brain Tumor Segmentation (BraTS) Challenge: Glioma Segmentation on Post-treatment MRI
- Author
-
de Verdier, Maria Correia, Saluja, Rachit, Gagnon, Louis, LaBella, Dominic, Baid, Ujjwall, Tahon, Nourel Hoda, Foltyn-Dumitru, Martha, Zhang, Jikai, Alafif, Maram, Baig, Saif, Chang, Ken, D'Anna, Gennaro, Deptula, Lisa, Gupta, Diviya, Haider, Muhammad Ammar, Hussain, Ali, Iv, Michael, Kontzialis, Marinos, Manning, Paul, Moodi, Farzan, Nunes, Teresa, Simon, Aaron, Sollmann, Nico, Vu, David, Adewole, Maruf, Albrecht, Jake, Anazodo, Udunna, Chai, Rongrong, Chung, Verena, Faghani, Shahriar, Farahani, Keyvan, Kazerooni, Anahita Fathi, Iglesias, Eugenio, Kofler, Florian, Li, Hongwei, Linguraru, Marius George, Menze, Bjoern, Moawad, Ahmed W., Velichko, Yury, Wiestler, Benedikt, Altes, Talissa, Basavasagar, Patil, Bendszus, Martin, Brugnara, Gianluca, Cho, Jaeyoung, Dhemesh, Yaseen, Fields, Brandon K. K., Garrett, Filip, Gass, Jaime, Hadjiiski, Lubomir, Hattangadi-Gluth, Jona, Hess, Christopher, Houk, Jessica L., Isufi, Edvin, Layfield, Lester J., Mastorakos, George, Mongan, John, Nedelec, Pierre, Nguyen, Uyen, Oliva, Sebastian, Pease, Matthew W., Rastogi, Aditya, Sinclair, Jason, Smith, Robert X., Sugrue, Leo P., Thacker, Jonathan, Vidic, Igor, Villanueva-Meyer, Javier, White, Nathan S., Aboian, Mariam, Conte, Gian Marco, Dale, Anders, Sabuncu, Mert R., Seibert, Tyler M., Weinberg, Brent, Abayazeed, Aly, Huang, Raymond, Turk, Sevcan, Rauschecker, Andreas M., Farid, Nikdokht, Vollmuth, Philipp, Nada, Ayman, Bakas, Spyridon, Calabrese, Evan, and Rudie, Jeffrey D.
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Gliomas are the most common malignant primary brain tumors in adults and one of the deadliest types of cancer. There are many challenges in treatment and monitoring due to the genetic diversity and high intrinsic heterogeneity in appearance, shape, histology, and treatment response. Treatments include surgery, radiation, and systemic therapies, with magnetic resonance imaging (MRI) playing a key role in treatment planning and post-treatment longitudinal assessment. The 2024 Brain Tumor Segmentation (BraTS) challenge on post-treatment glioma MRI will provide a community standard and benchmark for state-of-the-art automated segmentation models based on the largest expert-annotated post-treatment glioma MRI dataset. Challenge competitors will develop automated segmentation models to predict four distinct tumor sub-regions consisting of enhancing tissue (ET), surrounding non-enhancing T2/fluid-attenuated inversion recovery (FLAIR) hyperintensity (SNFH), non-enhancing tumor core (NETC), and resection cavity (RC). Models will be evaluated on separate validation and test datasets using standardized performance metrics utilized across the BraTS 2024 cluster of challenges, including lesion-wise Dice Similarity Coefficient and Hausdorff Distance. Models developed during this challenge will advance the field of automated MRI segmentation and contribute to their integration into clinical practice, ultimately enhancing patient care., Comment: 10 pages, 4 figures, 1 table
- Published
- 2024
35. Cross-Language Assessment of Mathematical Capability of ChatGPT
- Author
-
Sathe, Gargi, Shamraj, Aneesh, Surve, Aditya, Patil, Nahush, and Saxena, Kumkum
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
This paper presents an evaluation of the mathematical capability of ChatGPT across diverse languages like Hindi, Gujarati, and Marathi. ChatGPT, based on GPT-3.5 by OpenAI, has garnered significant attention for its natural language understanding and generation abilities. However, its performance in solving mathematical problems across multiple natural languages remains a comparatively unexplored area, especially in regional Indian languages. In this paper, we explore those capabilities as well as using chain-of-thought prompting to figure out if it increases the accuracy of responses as much as it does in the English language and provide insights into the current limitations.
- Published
- 2024
36. Sloshing Cold Fronts in Galaxy Cluster Abell 2566
- Author
-
Kadam, S. K., Sonkamble, Satish S., Vagshette, N. D., and Patil, M. K.
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Astrophysics of Galaxies - Abstract
This paper presents properties of the intracluster medium (ICM) in the environment of a cool core cluster Abell 2566 (redshift $z$ = 0.08247) based on the analysis of 20 ks Chandra X-ray data. 2D imaging analysis of the Chandra data from this cluster revealed spiral structures in the morphology of X-ray emission from within the central 109 kpc formed due to gas sloshing. This analysis also witness sharp edges in the surface brightness distribution along the south-east and north-west of the X-ray peaks at 41.6 kpc and 77.4 kpc, respectively. Spectral analysis of 0.5 - 7 keV X-ray photons along these discontinuities exhibited sharp temperature jumps from 2.3 to 3.1 keV and 1.8 to 2.8 keV, respectively, with consistency in the pressure profiles, implying their association with cold fronts due to gas sloshing of the gas. Further confirmation for such an association was provided by the deprojected broken power-law density function fit to the surface brightness distribution along these wedge shaped sectorial regions. This study also witness an offset of 4.6 arcsec (6.8 kpc) between the BCG and the X-ray peak, and interaction of the BCG with a sub-system in the central region, pointing towards the origin of the spiral structure due to a minor merger., Comment: 10 pages, 7 figures, Accepted for New Astronomy
- Published
- 2024
- Full Text
- View/download PDF
37. CaFA: Global Weather Forecasting with Factorized Attention on Sphere
- Author
-
Li, Zijie, Zhou, Anthony, Patil, Saurabh, and Farimani, Amir Barati
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computational Engineering, Finance, and Science - Abstract
Accurate weather forecasting is crucial in various sectors, impacting decision-making processes and societal events. Data-driven approaches based on machine learning models have recently emerged as a promising alternative to numerical weather prediction models given their potential to capture physics of different scales from historical data and the significantly lower computational cost during the prediction stage. Renowned for its state-of-the-art performance across diverse domains, the Transformer model has also gained popularity in machine learning weather prediction. Yet applying Transformer architectures to weather forecasting, particularly on a global scale is computationally challenging due to the quadratic complexity of attention and the quadratic increase in spatial points as resolution increases. In this work, we propose a factorized-attention-based model tailored for spherical geometries to mitigate this issue. More specifically, it utilizes multi-dimensional factorized kernels that convolve over different axes where the computational complexity of the kernel is only quadratic to the axial resolution instead of overall resolution. The deterministic forecasting accuracy of the proposed model on $1.5^\circ$ and 0-7 days' lead time is on par with state-of-the-art purely data-driven machine learning weather prediction models. We also showcase the proposed model holds great potential to push forward the Pareto front of accuracy-efficiency for Transformer weather models, where it can achieve better accuracy with less computational cost compared to Transformer based models with standard attention., Comment: Preprint
- Published
- 2024
38. An improved version of Kac's Central Limit Theorem
- Author
-
Bhar, Suprio, Mukherjee, Ritwik, and Patil, Prathmesh
- Subjects
Mathematics - Probability ,60F05, 37A99 - Abstract
The classical Central Limit Theorem (CLT) states that for a sequence of independent and identically distributed (i.i.d) random variables with finite mean and variance, the normalized sample mean converges to the standard normal distribution. In $1946$, Victor Kac proved a Central Limit type theorem for a sequence of random variables that were not independent. The random variables under consideration were obtained from the angle-doubling map. The idea behind Kac's proof was to show that although the random variables under consideration were not independent, they were what he calls \textit{statistically independent} (in modern terminology, this concept is called long range independence). The final conclusion of his paper was that the sample averages of the random variables, suitably normalized converges to the standard normal distribution. In the 1970's, Charles Stein revolutionized the field of probability by discovering a new method to obtain the limiting distribution for a sequence of random variables. Among other things, his method gave an alternative proof of the classical Central Limit Theorem. We obtain an improvement of Victor Kac's result by applying Stein's method. We show that the normalized sample averages converge to the standard normal distribution in the Wasserstein metric, which is stronger than the convergence in distribution.
- Published
- 2024
39. Cool-core, X-ray cavities and cold front revealed in RXCJ0352.9+1941 cluster by Chandra and GMRT observations
- Author
-
Sonkamble, Satish S., Kadam, S. K., Paul, Surajit, Pandge, M. B., Pawar, P. K., and Patil, M. K.
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Astrophysics of Galaxies - Abstract
This paper presents a comprehensive analysis of 30 ks Chandra and 46.8 ks (13 Hr) 1.4 GHz GMRT radio data on the cool-core cluster RXCJ0352.9+1941 with an objective to investigate AGN activities at its core. This study confirms a pair of X-ray cavities at projected distances of about 10.30 kpc and 20.80 kpc, respectively, on the NW and SE of the X-ray peak. GMRT L band (1.4 GHz) data revealed a bright radio source associated with the core of this cluster hosting multiple jet-like emissions. The spatial association of the X-ray cavities with the inner pair of radio jets confirm their origin due to AGN outbursts. The 1.4 GHz radio power ${\rm 7.4 \pm 0.8 \times 10^{39} \, erg\, s^{-1}}$ is correlated with the mechanical power stored in the X-ray cavities ($\sim7.90\times 10^{44}$ erg s$^{-1}$), implying that the power injected by radio jets in the ICM is sufficient enough to offset the radiative losses. The X-shaped morphology of diffuse radio emission seems to be comprised of two pairs of orthogonal radio jets, likely formed due to a spin-flip of jets due to the merger of two systems. The X-ray surface brightness analysis of the ICM in its environment revealed two non-uniform, extended spiral-like emission structures on either side of the core, pointing towards the sloshing of gas due to a minor merger and might have resulted in a cold front at $\sim$31 arcsec (62 kpc) with a temperature jump of 1.44 keV., Comment: 18 pages, 10 figures, Accepted for Journal of Astrophysics and Astronomy
- Published
- 2024
- Full Text
- View/download PDF
40. NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results
- Author
-
Liu, Xiaoning, Wu, Zongwei, Li, Ao, Vasluianu, Florin-Alexandru, Zhang, Yulun, Gu, Shuhang, Zhang, Le, Zhu, Ce, Timofte, Radu, Jin, Zhi, Wu, Hongjun, Wang, Chenxi, Ling, Haitao, Cai, Yuanhao, Bian, Hao, Zheng, Yuxin, Lin, Jing, Yuille, Alan, Shao, Ben, Guo, Jin, Liu, Tianli, Wu, Mohao, Feng, Yixu, Hou, Shuo, Lin, Haotian, Zhu, Yu, Wu, Peng, Dong, Wei, Sun, Jinqiu, Zhang, Yanning, Yan, Qingsen, Zou, Wenbin, Yang, Weipeng, Li, Yunxiang, Wei, Qiaomu, Ye, Tian, Chen, Sixiang, Zhang, Zhao, Zhao, Suiyi, Wang, Bo, Luo, Yan, Zuo, Zhichao, Wang, Mingshen, Wang, Junhu, Wei, Yanyan, Sun, Xiaopeng, Gao, Yu, Huang, Jiancheng, Chen, Hongming, Chen, Xiang, Tang, Hui, Chen, Yuanbin, Zhou, Yuanbo, Dai, Xinwei, Qiu, Xintao, Deng, Wei, Gao, Qinquan, Tong, Tong, Li, Mingjia, Hu, Jin, He, Xinyu, Guo, Xiaojie, Sabarinathan, Uma, K, Sasithradevi, A, Bama, B Sathya, Roomi, S. Mohamed Mansoor, Srivatsav, V., Wang, Jinjuan, Sun, Long, Chen, Qiuying, Shao, Jiahong, Zhang, Yizhi, Conde, Marcos V., Feijoo, Daniel, Benito, Juan C., García, Alvaro, Lee, Jaeho, Kim, Seongwan, A, Sharif S M, Khujaev, Nodirkhuja, Tsoy, Roman, Murtaza, Ali, Khairuddin, Uswah, Faudzi, Ahmad 'Athif Mohd, Malagi, Sampada, Joshi, Amogh, Akalwadi, Nikhil, Desai, Chaitra, Tabib, Ramesh Ashok, Mudenagudi, Uma, Lian, Wenyi, Lian, Wenjing, Kalyanshetti, Jagadeesh, Aralikatti, Vijayalaxmi Ashok, Yashaswini, Palani, Upasi, Nitish, Hegde, Dikshit, Patil, Ujwala, C, Sujata, Yan, Xingzhuo, Hao, Wei, Fu, Minghan, choksy, Pooja, Sarvaiya, Anjali, Upla, Kishor, Raja, Kiran, Yan, Hailong, Zhang, Yunkai, Li, Baiang, Zhang, Jingyi, and Zheng, Huan
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
This paper reviews the NTIRE 2024 low light image enhancement challenge, highlighting the proposed solutions and results. The aim of this challenge is to discover an effective network design or solution capable of generating brighter, clearer, and visually appealing results when dealing with a variety of conditions, including ultra-high resolution (4K and beyond), non-uniform illumination, backlighting, extreme darkness, and night scenes. A notable total of 428 participants registered for the challenge, with 22 teams ultimately making valid submissions. This paper meticulously evaluates the state-of-the-art advancements in enhancing low-light images, reflecting the significant progress and creativity in this field., Comment: NTIRE 2024 Challenge Report
- Published
- 2024
41. Mitigating Data Sharing in Public Cloud using Blockchain
- Author
-
Vijaykumar, Patil Pratik, Tulsiani, Prerna, and Mane, Sunil
- Subjects
Computer Science - Cryptography and Security - Abstract
Public Cloud Computing has become a fundamental part of modern IT infrastructure as its adoption has transformed the way businesses operate. However, cloud security concerns introduce new risks and challenges related to data protection, sharing, and access control. A synergistic integration of blockchain with the cloud holds immense potential. Blockchain's distributed ledger ensures transparency, immutability, and efficiency as it reduces the reliance on centralized authorities. Motivated by this, our framework proposes a secure data ecosystem in the cloud with the key aspects being Data Rights, Data Sharing, and Data Validation. Also, this approach aims to increase its interoperability and scalability by eliminating the need for data migration. This will ensure that existing public cloud-based systems can easily deploy blockchain enhancing trustworthiness and non-repudiation of cloud data.
- Published
- 2024
42. MahaSQuAD: Bridging Linguistic Divides in Marathi Question-Answering
- Author
-
Ghatage, Ruturaj, Kulkarni, Aditya, Patil, Rajlaxmi, Endait, Sharvi, and Joshi, Raviraj
- Subjects
Computer Science - Computation and Language ,Computer Science - Machine Learning - Abstract
Question-answering systems have revolutionized information retrieval, but linguistic and cultural boundaries limit their widespread accessibility. This research endeavors to bridge the gap of the absence of efficient QnA datasets in low-resource languages by translating the English Question Answering Dataset (SQuAD) using a robust data curation approach. We introduce MahaSQuAD, the first-ever full SQuAD dataset for the Indic language Marathi, consisting of 118,516 training, 11,873 validation, and 11,803 test samples. We also present a gold test set of manually verified 500 examples. Challenges in maintaining context and handling linguistic nuances are addressed, ensuring accurate translations. Moreover, as a QnA dataset cannot be simply converted into any low-resource language using translation, we need a robust method to map the answer translation to its span in the translated passage. Hence, to address this challenge, we also present a generic approach for translating SQuAD into any low-resource language. Thus, we offer a scalable approach to bridge linguistic and cultural gaps present in low-resource languages, in the realm of question-answering systems. The datasets and models are shared publicly at https://github.com/l3cube-pune/MarathiNLP ., Comment: Accepted at the International Conference on Natural Language Processing (ICON 2023)
- Published
- 2024
43. The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report
- Author
-
Ren, Bin, Li, Yawei, Mehta, Nancy, Timofte, Radu, Yu, Hongyuan, Wan, Cheng, Hong, Yuxin, Han, Bingnan, Wu, Zhuoyuan, Zou, Yajun, Liu, Yuqing, Li, Jizhe, He, Keji, Fan, Chao, Zhang, Heng, Zhang, Xiaolin, Yin, Xuanwu, Zuo, Kunlong, Liao, Bohao, Xia, Peizhe, Peng, Long, Du, Zhibo, Di, Xin, Li, Wangkai, Wang, Yang, Zhai, Wei, Pei, Renjing, Guo, Jiaming, Xu, Songcen, Cao, Yang, Zha, Zhengjun, Wang, Yan, Liu, Yi, Wang, Qing, Zhang, Gang, Zhang, Liou, Zhao, Shijie, Sun, Long, Pan, Jinshan, Dong, Jiangxin, Tang, Jinhui, Liu, Xin, Yan, Min, Wang, Qian, Zhou, Menghan, Yan, Yiqiang, Liu, Yixuan, Chan, Wensong, Tang, Dehua, Zhou, Dong, Wang, Li, Tian, Lu, Emad, Barsoum, Jia, Bohan, Qiao, Junbo, Zhou, Yunshuai, Zhang, Yun, Li, Wei, Lin, Shaohui, Zhou, Shenglong, Chen, Binbin, Liao, Jincheng, Zhao, Suiyi, Zhang, Zhao, Wang, Bo, Luo, Yan, Wei, Yanyan, Li, Feng, Wang, Mingshen, Guan, Jinhan, Hu, Dehua, Yu, Jiawei, Xu, Qisheng, Sun, Tao, Lan, Long, Xu, Kele, Lin, Xin, Yue, Jingtong, Yang, Lehan, Du, Shiyi, Qi, Lu, Ren, Chao, Han, Zeyu, Wang, Yuhan, Chen, Chaolin, Li, Haobo, Zheng, Mingjun, Yang, Zhongbao, Song, Lianhong, Yan, Xingzhuo, Fu, Minghan, Zhang, Jingyi, Li, Baiang, Zhu, Qi, Xu, Xiaogang, Guo, Dan, Guo, Chunle, Chen, Jiadi, Long, Huanhuan, Duanmu, Chunjiang, Lei, Xiaoyan, Liu, Jie, Jia, Weilin, Cao, Weifeng, Zhang, Wenlong, Mao, Yanyu, Guo, Ruilong, Zhang, Nihao, Pandey, Manoj, Chernozhukov, Maksym, Le, Giang, Cheng, Shuli, Wang, Hongyuan, Wei, Ziyan, Tang, Qingting, Wang, Liejun, Li, Yongming, Guo, Yanhui, Xu, Hao, Khatami-Rizi, Akram, Mahmoudi-Aznaveh, Ahmad, Hsu, Chih-Chung, Lee, Chia-Ming, Chou, Yi-Shiuan, Joshi, Amogh, Akalwadi, Nikhil, Malagi, Sampada, Yashaswini, Palani, Desai, Chaitra, Tabib, Ramesh Ashok, Patil, Ujwala, and Mudenagudi, Uma
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Electrical Engineering and Systems Science - Image and Video Processing - Abstract
This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such as runtime, parameters, and FLOPs, while still maintaining a peak signal-to-noise ratio (PSNR) of approximately 26.90 dB on the DIV2K_LSDIR_valid dataset and 26.99 dB on the DIV2K_LSDIR_test dataset. In addition, this challenge has 4 tracks including the main track (overall performance), sub-track 1 (runtime), sub-track 2 (FLOPs), and sub-track 3 (parameters). In the main track, all three metrics (ie runtime, FLOPs, and parameter count) were considered. The ranking of the main track is calculated based on a weighted sum-up of the scores of all other sub-tracks. In sub-track 1, the practical runtime performance of the submissions was evaluated, and the corresponding score was used to determine the ranking. In sub-track 2, the number of FLOPs was considered. The score calculated based on the corresponding FLOPs was used to determine the ranking. In sub-track 3, the number of parameters was considered. The score calculated based on the corresponding parameters was used to determine the ranking. RLFN is set as the baseline for efficiency measurement. The challenge had 262 registered participants, and 34 teams made valid submissions. They gauge the state-of-the-art in efficient single-image super-resolution. To facilitate the reproducibility of the challenge and enable other researchers to build upon these findings, the code and the pre-trained model of validated solutions are made publicly available at https://github.com/Amazingren/NTIRE2024_ESR/., Comment: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024
- Published
- 2024
44. NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results
- Author
-
Chen, Zheng, Wu, Zongwei, Zamfir, Eduard, Zhang, Kai, Zhang, Yulun, Timofte, Radu, Yang, Xiaokang, Yu, Hongyuan, Wan, Cheng, Hong, Yuxin, Huang, Zhijuan, Zou, Yajun, Huang, Yuan, Lin, Jiamin, Han, Bingnan, Guan, Xianyu, Yu, Yongsheng, Zhang, Daoan, Yin, Xuanwu, Zuo, Kunlong, Hao, Jinhua, Zhao, Kai, Yuan, Kun, Sun, Ming, Zhou, Chao, An, Hongyu, Zhang, Xinfeng, Song, Zhiyuan, Dong, Ziyue, Zhao, Qing, Xu, Xiaogang, Wei, Pengxu, Dou, Zhi-chao, Wang, Gui-ling, Hsu, Chih-Chung, Lee, Chia-Ming, Chou, Yi-Shiuan, Korkmaz, Cansu, Tekalp, A. Murat, Wei, Yubin, Yan, Xiaole, Li, Binren, Chen, Haonan, Zhang, Siqi, Chen, Sihan, Joshi, Amogh, Akalwadi, Nikhil, Malagi, Sampada, Yashaswini, Palani, Desai, Chaitra, Tabib, Ramesh Ashok, Patil, Ujwala, Mudenagudi, Uma, Sarvaiya, Anjali, Choksy, Pooja, Joshi, Jagrit, Kawa, Shubh, Upla, Kishor, Patwardhan, Sushrut, Ramachandra, Raghavendra, Hossain, Sadat, Park, Geongi, Uddin, S. M. Nadim, Xu, Hao, Guo, Yanhui, Urumbekov, Aman, Yan, Xingzhuo, Hao, Wei, Fu, Minghan, Orais, Isaac, Smith, Samuel, Liu, Ying, Jia, Wangwang, Xu, Qisheng, Xu, Kele, Yuan, Weijun, Li, Zhan, Kuang, Wenqin, Guan, Ruijin, Deng, Ruting, Zhang, Zhao, Wang, Bo, Zhao, Suiyi, Luo, Yan, Wei, Yanyan, Khan, Asif Hussain, Micheloni, Christian, and Martinel, Niki
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge is to obtain designs/solutions with the most advanced SR performance, with no constraints on computational resources (e.g., model size and FLOPs) or training data. The track of this challenge assesses performance with the PSNR metric on the DIV2K testing dataset. The competition attracted 199 registrants, with 20 teams submitting valid entries. This collective endeavour not only pushes the boundaries of performance in single-image SR but also offers a comprehensive overview of current trends in this field., Comment: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4
- Published
- 2024
45. LLoCO: Learning Long Contexts Offline
- Author
-
Tan, Sijun, Li, Xiuyu, Patil, Shishir, Wu, Ziyang, Zhang, Tianjun, Keutzer, Kurt, Gonzalez, Joseph E., and Popa, Raluca Ada
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Processing long contexts remains a challenge for large language models (LLMs) due to the quadratic computational and memory overhead of the self-attention mechanism and the substantial KV cache sizes during generation. We propose a novel approach to address this problem by learning contexts offline through context compression and in-domain parameter-efficient finetuning. Our method enables an LLM to create a concise representation of the original context and efficiently retrieve relevant information to answer questions accurately. We introduce LLoCO, a technique that combines context compression, retrieval, and parameter-efficient finetuning using LoRA. Our approach extends the effective context window of a 4k token LLaMA2-7B model to handle up to 128k tokens. We evaluate our approach on several long-context question-answering datasets, demonstrating that LLoCO significantly outperforms in-context learning while using $30\times$ fewer tokens during inference. LLoCO achieves up to $7.62\times$ speed-up and substantially reduces the cost of long document question answering, making it a promising solution for efficient long context processing. Our code is publicly available at https://github.com/jeffreysijuntan/lloco., Comment: The first two authors contributed equally to this work
- Published
- 2024
46. A new approach to construct minimal linear codes over $\mathbb{F}_{3}$
- Author
-
Shaikh, Wajid M., Jain, Rupali S., Reddy, B. Surendranath, Patil, Bhagyashri S., and Maqbol, Sahar M. A.
- Subjects
Computer Science - Information Theory ,Mathematics - Rings and Algebras - Abstract
In this article, we present two new approaches to construct minimal linear codes of dimension $n+1$ over $\mathbb{F}_{3}$ using characteristic and ternary functions. We also obtain the weight distributions of these constructed minimal linear codes. We further show that a specific class of these codes violates Ashikhmin-Barg condition.
- Published
- 2024
47. Product Description and QA Assisted Self-Supervised Opinion Summarization
- Author
-
Siledar, Tejpalsingh, Rangaraju, Rupasai, Muddu, Sankara Sri Raghava Ravindra, Banerjee, Suman, Patil, Amey, Singh, Sudhanshu Shekhar, Chelliah, Muthusamy, Garera, Nikesh, Nath, Swaprava, and Bhattacharyya, Pushpak
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
In e-commerce, opinion summarization is the process of summarizing the consensus opinions found in product reviews. However, the potential of additional sources such as product description and question-answers (QA) has been considered less often. Moreover, the absence of any supervised training data makes this task challenging. To address this, we propose a novel synthetic dataset creation (SDC) strategy that leverages information from reviews as well as additional sources for selecting one of the reviews as a pseudo-summary to enable supervised training. Our Multi-Encoder Decoder framework for Opinion Summarization (MEDOS) employs a separate encoder for each source, enabling effective selection of information while generating the summary. For evaluation, due to the unavailability of test sets with additional sources, we extend the Amazon, Oposum+, and Flipkart test sets and leverage ChatGPT to annotate summaries. Experiments across nine test sets demonstrate that the combination of our SDC approach and MEDOS model achieves on average a 14.5% improvement in ROUGE-1 F1 over the SOTA. Moreover, comparative analysis underlines the significance of incorporating additional sources for generating more informative summaries. Human evaluations further indicate that MEDOS scores relatively higher in coherence and fluency with 0.41 and 0.5 (-1 to 1) respectively, compared to existing models. To the best of our knowledge, we are the first to generate opinion summaries leveraging additional sources in a self-supervised setting.
- Published
- 2024
48. UDON: A case for offloading to general purpose compute on CXL memory
- Author
-
Hermes, Jon, Minor, Josh, Wu, Minjun, Patil, Adarsh, and Van Hensbergen, Eric
- Subjects
Computer Science - Emerging Technologies - Abstract
Upcoming CXL-based disaggregated memory devices feature special purpose units to offload compute to near-memory. In this paper, we explore opportunities for offloading compute to general purpose cores on CXL memory devices, thereby enabling a greater utility and diversity of offload. We study two classes of popular memory intensive applications: ML inference and vector database as candidates for computational offload. The study uses Arm AArch64-based dual-socket NUMA systems to emulate CXL type-2 devices. Our study shows promising results. With our ML inference model partitioning strategy for compute offload, we can place up to 90% data in remote memory with just 20% performance trade-off. Offloading Hierarchical Navigable Small World (HNSW) kernels in vector databases can provide upto 6.87$\times$ performance improvement with under 10% offload overhead., Comment: Presented at the 3rd Workshop on Heterogeneous Composable and Disaggregated Systems (HCDS 2024)
- Published
- 2024
49. Outcomes of underwater endoscopic mucosal resection for colorectal polyps—Insights from western India
- Author
-
Sundaram, Sridhar, Patil, Gaurav Kumar, Jain, Aadish Kumar, Dalal, Ankit, Patil, Prachi, Mehta, Shaesta, and Maydeo, Amit
- Published
- 2024
- Full Text
- View/download PDF
50. A Comparative Analysis of Plant Canopy Detection Performance in a Variable-Rate Spraying System Using Deep Learning Models
- Author
-
Patil, Seema Suhas, Patil, Yuvaraj Mahadev, Patil, Suhas Bapuso, and Powar, Ranjit Vasant
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.