39,912 results on '"Suresh, P."'
Search Results
2. Optimal Box Contraction for Solving Linear Systems via Simulated and Quantum Annealing
- Author
-
Suresh, Sanjay and Suresh, Krishnan
- Subjects
Computer Science - Computational Engineering, Finance, and Science - Abstract
Solving linear systems of equations is an important problem in science and engineering. Many quantum algorithms, such as the Harrow-Hassidim-Lloyd (HHL) algorithm (for quantum-gate computers) and the box algorithm (for quantum-annealing machines), have been proposed for solving such systems. The focus of this paper is on improving the efficiency of the box algorithm. The basic principle behind this algorithm is to transform the linear system into a series of quadratic unconstrained binary optimization (QUBO) problems, which are then solved on annealing machines. The computational efficiency of the box algorithm is entirely determined by the number of iterations, which, in turn, depends on the box contraction ratio, typically set to 0.5. Here, we show through theory that a contraction ratio of 0.5 is sub-optimal and that we can achieve a speed-up with a contraction ratio of 0.2. This is confirmed through numerical experiments where a speed-up between $20 \%$ to $60 \%$ is observed when the optimal contraction ratio is used.
- Published
- 2024
3. This title is unavailable for guests, please login to see more information.
- Published
- 2024
4. Physical and chemical methods of extraction of bioactive molecules from Lepidium sativum linn. and antioxidant activity-based screening and selection of extracts-probable phytochemical, chromatography and mass spectroscopy analysis-based correlates
- Author
-
Rajasekaran, R. and Suresh, P. K.
- Published
- 2021
- Full Text
- View/download PDF
5. Determination of elemental impurities of Arsenic, Cadmium, Mercury, Lead and Palladium content in Testosterone propionate by using ICP-MS
- Author
-
Suresh, P. and Kumar, Konda Ravi
- Published
- 2021
- Full Text
- View/download PDF
6. Deep Attention Driven Reinforcement Learning (DAD-RL) for Autonomous Vehicle Decision-Making in Dynamic Environment
- Author
-
Chowdhury, Jayabrata, Shivaraman, Venkataramanan, Dangi, Sumit, Sundaram, Suresh, and Sujit, P. B.
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Robotics - Abstract
Autonomous Vehicle (AV) decision making in urban environments is inherently challenging due to the dynamic interactions with surrounding vehicles. For safe planning, AV must understand the weightage of various spatiotemporal interactions in a scene. Contemporary works use colossal transformer architectures to encode interactions mainly for trajectory prediction, resulting in increased computational complexity. To address this issue without compromising spatiotemporal understanding and performance, we propose the simple Deep Attention Driven Reinforcement Learning (DADRL) framework, which dynamically assigns and incorporates the significance of surrounding vehicles into the ego's RL driven decision making process. We introduce an AV centric spatiotemporal attention encoding (STAE) mechanism for learning the dynamic interactions with different surrounding vehicles. To understand map and route context, we employ a context encoder to extract features from context maps. The spatiotemporal representations combined with contextual encoding provide a comprehensive state representation. The resulting model is trained using the Soft Actor Critic (SAC) algorithm. We evaluate the proposed framework on the SMARTS urban benchmarking scenarios without traffic signals to demonstrate that DADRL outperforms recent state of the art methods. Furthermore, an ablation study underscores the importance of the context-encoder and spatio temporal attention encoder in achieving superior performance., Comment: 6 pages, 3 figures
- Published
- 2024
7. Operationalizing the Blueprint for an AI Bill of Rights: Recommendations for Practitioners, Researchers, and Policy Makers
- Author
-
Oesterling, Alex, Bhalla, Usha, Venkatasubramanian, Suresh, and Lakkaraju, Himabindu
- Subjects
Computer Science - Artificial Intelligence ,Computer Science - Computers and Society ,Computer Science - Machine Learning - Abstract
As Artificial Intelligence (AI) tools are increasingly employed in diverse real-world applications, there has been significant interest in regulating these tools. To this end, several regulatory frameworks have been introduced by different countries worldwide. For example, the European Union recently passed the AI Act, the White House issued an Executive Order on safe, secure, and trustworthy AI, and the White House Office of Science and Technology Policy issued the Blueprint for an AI Bill of Rights (AI BoR). Many of these frameworks emphasize the need for auditing and improving the trustworthiness of AI tools, underscoring the importance of safety, privacy, explainability, fairness, and human fallback options. Although these regulatory frameworks highlight the necessity of enforcement, practitioners often lack detailed guidance on implementing them. Furthermore, the extensive research on operationalizing each of these aspects is frequently buried in technical papers that are difficult for practitioners to parse. In this write-up, we address this shortcoming by providing an accessible overview of existing literature related to operationalizing regulatory principles. We provide easy-to-understand summaries of state-of-the-art literature and highlight various gaps that exist between regulatory guidelines and existing AI research, including the trade-offs that emerge during operationalization. We hope that this work not only serves as a starting point for practitioners interested in learning more about operationalizing the regulatory guidelines outlined in the Blueprint for an AI BoR but also provides researchers with a list of critical open problems and gaps between regulations and state-of-the-art AI research. Finally, we note that this is a working paper and we invite feedback in line with the purpose of this document as described in the introduction., Comment: 15 pages
- Published
- 2024
8. Probabilistic learning rate scheduler with provable convergence
- Author
-
Devapriya, Dahlia, Tholeti, Thulasi, Suresh, Janani, and Kalyani, Sheetal
- Subjects
Computer Science - Machine Learning - Abstract
Learning rate schedulers have shown great success in speeding up the convergence of learning algorithms in practice. However, their convergence to a minimum has not been proven theoretically. This difficulty mainly arises from the fact that, while traditional convergence analysis prescribes to monotonically decreasing (or constant) learning rates, schedulers opt for rates that often increase and decrease through the training epochs. In this work, we aim to bridge the gap by proposing a probabilistic learning rate scheduler (PLRS), that does not conform to the monotonically decreasing condition, with provable convergence guarantees. In addition to providing detailed convergence proofs, we also show experimental results where the proposed PLRS performs competitively as other state-of-the-art learning rate schedulers across a variety of datasets and architectures.
- Published
- 2024
9. Untangling the Furball: A Practice Mapping Approach to the Analysis of Multimodal Interactions in Social Networks
- Author
-
Bruns, Axel, Kasianenko, Kateryna, Suresh, Vishnu Padinjaredath, Dehghan, Ehsan, and Vodden, Laura
- Subjects
Computer Science - Social and Information Networks - Abstract
This article introduces the analytical approach of practice mapping, using vector embeddings of network actions and interactions to map commonalities and disjunctures in the practices of social media users, as a framework for methodological advancement beyond the limitations of conventional network analysis and visualisation. In particular, the methodological framework we outline here has the potential to incorporate multiple distinct modes of interaction into a single practice map, can be further enriched with account-level attributes such as information gleaned from textual analysis, profile information, available demographic details, and other features, and can be applied even to a cross-platform analysis of communicative patterns and practices. The article presents practice mapping as an analytical framework and outlines its key methodological considerations. Given its prominence in past social media research, we draw on examples and data from the platform formerly known as Twitter in order to enable experienced scholars to translate their approaches to a practice mapping paradigm more easily, but point out how data from other platforms may be used in equivalent ways in practice mapping studies. We illustrate the utility of the approach by applying it to a dataset where the application of conventional network analysis and visualisation approaches has produced few meaningful insights.
- Published
- 2024
10. An Upper Limit on the Photoproduction Cross Section of the Spin-Exotic $\pi_1(1600)$
- Author
-
Afzal, F., Akondi, C. S., Albrecht, M., Amaryan, M., Arrigo, S., Arroyave, V., Asaturyan, A., Austregesilo, A., Baldwin, Z., Barbosa, F., Barlow, J., Barriga, E., Barsotti, R., Barton, D., Baturin, V., Berdnikov, V. V., Black, T., Boeglin, W., Boer, M., Briscoe, W. J., Britton, T., Cao, S., Chudakov, E., Chung, G., Cole, P. L., Cortes, O., Crede, V., Dalton, M. M., Darulis, D., Deur, A., Dobbs, S., Dolgolenko, A., Dugger, M., Dzhygadlo, R., Ebersole, D., Edo, M., Egiyan, H., Erbora, T., Eugenio, P., Fabrizi, A., Fanelli, C., Fang, S., Fitches, J., Foda, A. M., Furletov, S., Gan, L., Gao, H., Gardner, A., Gasparian, A., Glazier, D., Gleason, C., Goryachev, V. S., Grube, B., Guo, J., Guo, L., Hernandez, J., Hernandez, K., Hoffman, N. D., Hornidge, D., Hou, G., Hurck, P., Hurley, A., Imoehl, W., Ireland, D. G., Ito, M. M., Jaegle, I., Jarvis, N. S., Jeske, T., Jing, M., Jones, R. T., Kakoyan, V., Kalicy, G., Khachatryan, V., Kourkoumelis, C., LaDuke, A., Larin, I., Lawrence, D., Lersch, D. I., Li, H., Liu, B., Livingston, K., Lolos, G. J., Lorenti, L., Lyubovitskij, V., Ma, R., Mack, D., Mahmood, A., Marukyan, H., Matveev, V., McCaughan, M., McCracken, M., Meyer, C. A., Miskimen, R., Mitchell, R. E., Mizutani, K., Neelamana, V., Ng, L., Nissen, E., Orešić, S., Ostrovidov, A. I., Papandreou, Z., Paudel, C., Pedroni, R., Pentchev, L., Peters, K. J., Prather, E., Rakshit, S., Reinhold, J., Remington, A., Ritchie, B. G., Ritman, J., Rodriguez, G., Romanov, D., Saldana, K., Salgado, C., Schadmand, S., Schertz, A. M., Scheuer, K., Schick, A., Schmidt, A., Schumacher, R. A., Schwiening, J., Septian, N., Sharp, P., Shen, X., Shepherd, M. R., Sikes, J., Smith, A., Smith, E. S., Sober, D. I., Somov, A., Somov, S., Stevens, J. R., Strakovsky, I. I., Sumner, B., Suresh, K., Tarasov, V. V., Taylor, S., Teymurazyan, A., Thiel, A., Viducic, T., Whitlatch, T., Wickramaarachchi, N., Wunderlich, Y., Yu, B., Zarling, J., Zhang, Z., Zhou, X., and Zihlmann, B.
- Subjects
Nuclear Experiment ,High Energy Physics - Experiment - Abstract
The spin-exotic hybrid meson $\pi_{1}(1600)$ is predicted to have a large decay rate to the $\omega\pi\pi$ final state. Using 76.6~pb$^{-1}$ of data collected with the GlueX detector, we measure the cross sections for the reactions $\gamma p \to \omega \pi^+ \pi^- p$, $\gamma p \to \omega \pi^0 \pi^0 p$, and $\gamma p\to\omega\pi^-\pi^0\Delta^{++}$ in the range $E_\gamma =$ 8-10 GeV. Using isospin conservation, we set the first upper limits on the photoproduction cross sections of the $\pi^{0}_{1}(1600)$ and $\pi^{-}_{1}(1600)$. We combine these limits with lattice calculations of decay widths and find that photoproduction of $\eta'\pi$ is the most sensitive two-body system to search for the $\pi_1(1600)$., Comment: 6 pages, 3 figures plus supplemental materials
- Published
- 2024
11. Local-global principle for groups of type An over semi global fields
- Author
-
Suresh, V.
- Subjects
Mathematics - Algebraic Geometry ,Mathematics - Number Theory ,11E72, 12G05, 14G05 (primary) 14H25, 20G15, 14G27 (secondary) - Abstract
Let F be the function field of a curve over a complete discretely valued field K. Let G be a semisimple simply connected linear algebraic group over F of type An. We give a description of the obstruction to local global principle for principal homogeneous spaces under G over F with respect to discrete valuations of F in terms of R-equivalence classes of G over some suitable over fields. Using this description we prove that this obstruction vanishes under some conditions on the residue field K., Comment: 32 pages
- Published
- 2024
12. $\Lambda_{\rm s}$CDM cosmology: Alleviating major cosmological tensions by predicting standard neutrino properties
- Author
-
Yadav, Anita, Kumar, Suresh, Kibris, Cihad, and Akarsu, Ozgur
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics ,General Relativity and Quantum Cosmology ,High Energy Physics - Phenomenology ,High Energy Physics - Theory - Abstract
We investigate a two-parameter extension of the $\Lambda_{\rm s}$CDM model by allowing variations in the effective number of neutrino species $N_{\rm eff}$ and their total mass $\sum m_\nu$. Our motivation is twofold: (i) to examine whether $\Lambda_{\rm s}$CDM retains its success in fitting the data and addressing major cosmological tensions, without suggesting a need for a deviation from the standard model of particle physics, and (ii) to determine whether the data indicate new physics that could potentially address cosmological tensions, either in the post-recombination universe through the late-time mirror AdS-dS transition, or in the pre-recombination universe through modifications in the standard values of $N_{\rm eff}$ and $\sum m_\nu$, or both. Within the extended $\Lambda_{\rm s}$CDM model, referred to as $\Lambda_{\rm s}$CDM+$N_{\rm eff}$+$\sum m_{\rm \nu}$, we find no significant tension when considering the Planck-alone analysis. We observe that incorporating BAO data limits the further success of the $\Lambda_{\rm s}$CDM extension. However, the weakly model-dependent BAOtr data, along with Planck and Planck+PP\&SH0ES, favor $H_0\sim 73\,{\rm km\, s^{-1}\, Mpc^{-1}}$. In cases where BAOtr dataset is used, the mirror AdS-dS transition is very effective in providing enhanced $H_0$ values, and thus the model requires no significant deviation from the standard value of $N_{\rm eff} = 3.044$. Both the $H_0$ and $S_8$ tensions are effectively addressed, with some compromise in the case of the Planck+BAO dataset. Finally, the upper bounds obtained on $\sum m_\nu \lesssim 0.5$~eV are fully compatible with neutrino oscillation experiments. Our findings provide evidence that late-time physics beyond $\Lambda$CDM, such as $\Lambda_{\rm s}$CDM, without altering the standard pre-recombination universe, can suffice to alleviate the major cosmological tensions., Comment: 20 pages, 6 figures, 3 tables
- Published
- 2024
13. An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks
- Author
-
Suresh, Varsha, Aït-Mokhtar, Salah, Brun, Caroline, and Calapodescu, Ioan
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
Self-supervised learning models have revolutionized the field of speech processing. However, the process of fine-tuning these models on downstream tasks requires substantial computational resources, particularly when dealing with multiple speech-processing tasks. In this paper, we explore the potential of adapter-based fine-tuning in developing a unified model capable of effectively handling multiple spoken language processing tasks. The tasks we investigate are Automatic Speech Recognition, Phoneme Recognition, Intent Classification, Slot Filling, and Spoken Emotion Recognition. We validate our approach through a series of experiments on the SUPERB benchmark, and our results indicate that adapter-based fine-tuning enables a single encoder-decoder model to perform multiple speech processing tasks with an average improvement of 18.4% across the five target tasks while staying efficient in terms of parameter updates., Comment: ICASSP 2024
- Published
- 2024
- Full Text
- View/download PDF
14. Measurement of Spin-Density Matrix Elements in $\Delta^{++}(1232)$ photoproduction
- Author
-
Afzal, F., Akondi, C. S., Albrecht, M., Amaryan, M., Arrigo, S., Arroyave, V., Asaturyan, A., Austregesilo, A., Baldwin, Z., Barbosa, F., Barlow, J., Barriga, E., Barsotti, R., Barton, D., Baturin, V., Berdnikov, V. V., Black, T., Boeglin, W., Boer, M., Briscoe, W. J., Britton, T., Cao, S., Chudakov, E., Chung, G., Cole, P. L., Cortes, O., Crede, V., Dalton, M. M., Darulis, D., Deur, A., Dobbs, S., Dolgolenko, A., Dugger, M., Dzhygadlo, R., Ebersole, D., Edo, M., Egiyan, H., Erbora, T., Eugenio, P., Fabrizi, A., Fanelli, C., Fang, S., Fitches, J., Foda, A. M., Furletov, S., Gan, L., Gao, H., Gardner, A., Gasparian, A., Glazier, D., Gleason, C., Goryachev, V. S., Grube, B., Guo, J., Guo, L., Hernandez, J., Hernandez, K., Hoffman, N. D., Hornidge, D., Hou, G., Hurck, P., Hurley, A., Imoehl, W., Ireland, D. G., Ito, M. M., Jaegle, I., Jarvis, N. S., Jeske, T., Jing, M., Jones, R. T., Kakoyan, V., Kalicy, G., Khachatryan, V., Kourkoumelis, C., LaDuke, A., Larin, I., Lawrence, D., Lersch, D. I., Li, H., Liu, B., Livingston, K., Lolos, G. J., Lorenti, L., Lyubovitskij, V., Ma, R., Mack, D., Mahmood, A., Marukyan, H., Matveev, V., McCaughan, M., McCracken, M., Meyer, C. A., Miskimen, R., Mitchell, R. E., Mizutani, K., Neelamana, V., Ng, L., Nissen, E., Orešić, S., Ostrovidov, A. I., Papandreou, Z., Paudel, C., Pedroni, R., Pentchev, L., Peters, K. J., Prather, E., Rakshit, S., Reinhold, J., Remington, A., Ritchie, B. G., Ritman, J., Rodriguez, G., Romanov, D., Saldana, K., Salgado, C., Schadmand, S., Schertz, A. M., Scheuer, K., Schick, A., Schmidt, A., Schumacher, R. A., Schwiening, J., Septian, N., Sharp, P., Shen, X., Shepherd, M. R., Sikes, J., Smith, A., Smith, E. S., Sober, D. I., Somov, A., Somov, S., Stevens, J. R., Strakovsky, I. I., Sumner, B., Suresh, K., Tarasov, V. V., Taylor, S., Teymurazyan, A., Thiel, A., Viducic, T., Whitlatch, T., Wickramaarachchi, N., Wunderlich, Y., Yu, B., Zarling, J., Zhang, Z., Zhou, X., and Zihlmann, B.
- Subjects
Nuclear Experiment - Abstract
We report the measurement of spin-density matrix elements of the $\Delta^{++}(1232)$ in the photoproduction reaction $\gamma p \to \pi^-\Delta^{++}(1232)$ with the GlueX experiment in Hall D at Jefferson Lab. The measurement used a linearly polarized photon beam with $E_\gamma=8.2-8.8$~GeV and the statistical precision exceeds the previous measurement from SLAC by three orders of magnitude for the momentum transfer squared region $-t < 1.4$ GeV$^2$. The data are sensitive to the previously undetermined relative sign between couplings in existing Regge exchange models. Linear combinations of the extracted SDMEs allow for a decomposition into natural and unnatural exchange amplitudes, which shows that the unnatural exchange plays an important role in the low $-t$ region.
- Published
- 2024
15. Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning
- Author
-
Zhang, Jifan, Jain, Lalit, Guo, Yang, Chen, Jiayi, Zhou, Kuan Lok, Suresh, Siddharth, Wagenmaker, Andrew, Sievert, Scott, Rogers, Timothy, Jamieson, Kevin, Mankoff, Robert, and Nowak, Robert
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computation and Language - Abstract
We present a novel multimodal preference dataset for creative tasks, consisting of over 250 million human ratings on more than 2.2 million captions, collected through crowdsourcing rating data for The New Yorker's weekly cartoon caption contest over the past eight years. This unique dataset supports the development and evaluation of multimodal large language models and preference-based fine-tuning algorithms for humorous caption generation. We propose novel benchmarks for judging the quality of model-generated captions, utilizing both GPT4 and human judgments to establish ranking-based evaluation strategies. Our experimental results highlight the limitations of current fine-tuning methods, such as RLHF and DPO, when applied to creative tasks. Furthermore, we demonstrate that even state-of-the-art models like GPT4 and Claude currently underperform top human contestants in generating humorous captions. As we conclude this extensive data collection effort, we release the entire preference dataset to the research community, fostering further advancements in AI humor generation and evaluation.
- Published
- 2024
16. Cosmological constraints on $\Lambda_{\rm s}$CDM scenario in a type II minimally modified gravity
- Author
-
Akarsu, Ozgur, De Felice, Antonio, Di Valentino, Eleonora, Kumar, Suresh, Nunes, Rafael C., Ozulker, Emre, Vazquez, J. Alberto, and Yadav, Anita
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics ,General Relativity and Quantum Cosmology ,High Energy Physics - Phenomenology ,High Energy Physics - Theory - Abstract
The idea of a rapid sign-switching cosmological constant (mirror AdS-dS transition) in the late universe at $z\sim1.7$, known as the $\Lambda_{\rm s}$CDM model, has significantly improved the fit to observational data and provides a promising scenario for alleviating major cosmological tensions, such as the $H_0$ and $S_8$ tensions. However, in the absence of a fully predictive model, implementing this fit required conjecturing that the dynamics of the linear perturbations are governed by general relativity. Recent work embedding the $\Lambda_{\rm s}$CDM model with the Lagrangian of a type-II minimally modified gravity known as VCDM has propelled $\Lambda_{\rm s}$CDM to a fully predictive model, removing the uncertainty related to the aforementioned assumption; we call this new model $\Lambda_{\rm s}$VCDM. In this work, we demonstrate that not only does $\Lambda_{\rm s}$CDM fit the data better than the standard $\Lambda$CDM model, but the new model, $\Lambda_{\rm s}$VCDM, performs even better in alleviating cosmological tensions while also providing a better fit to the data, including CMB, BAO, SNe Ia, and Cosmic Shear measurements. Our findings highlight the $\Lambda_{\rm s}$CDM framework, particularly the $\Lambda_{\rm s}$VCDM model, as a compelling alternative to the standard $\Lambda$CDM model, especially by successfully alleviating the $H_0$ tension. Additionally, these models predict higher values for $\sigma_8$, indicating enhanced structuring, albeit with lower present-day matter density parameter values and consequently reduced $S_8$ values, alleviating the $S_8$ tension as well. This demonstrates that the data are well fit by a combination of background and linear perturbations, both having dynamics differing from those of $\Lambda$CDM. This paves the way for further exploration of new ways for embedding the sign-switching cosmological constant into other models., Comment: 15 pages, 5 figures, 2 tables
- Published
- 2024
17. Component Matching Approach in Linking Business and Application Architecture
- Author
-
Kamath, Suresh
- Subjects
Computer Science - Software Engineering - Abstract
The development of an IT strategy and ensuring that it is the best possible one for business is a key problem many organizations face. This problem is that of linking business architecture to IT architecture in general and application architecture specifically. In our earlier work we proposed Category theory as the formal language to unify the business and IT worlds with the ability to represent the concepts and relations between the two in a unified way. We used rCOS as the underlying model for the specification of interfaces, contracts, and components. The concept of pseudo-category was then utilized to represent the business and application architecture specifications and the relationships contained within. The linkages between them now can be established using the matching of the business component contracts with the application component contracts. However the matching was based on manual process and in this paper we extend the work by considering automated component matching process. The ground work for a tool to support the matching process is laid out in this paper., Comment: 8 pages, one figure
- Published
- 2024
18. Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline
- Author
-
Salman, Ali N., Du, Zongyang, Chandra, Shreeram Suresh, Ulgen, Ismail Rasim, Busso, Carlos, and Sisman, Berrak
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
Voice conversion (VC) research traditionally depends on scripted or acted speech, which lacks the natural spontaneity of real-life conversations. While natural speech data is limited for VC, our study focuses on filling in this gap. We introduce a novel data-sourcing pipeline that makes the release of a natural speech dataset for VC, named NaturalVoices. The pipeline extracts rich information in speech such as emotion and signal-to-noise ratio (SNR) from raw podcast data, utilizing recent deep learning methods and providing flexibility and ease of use. NaturalVoices marks a large-scale, spontaneous, expressive, and emotional speech dataset, comprising over 3,800 hours speech sourced from the original podcasts in the MSP-Podcast dataset. Objective and subjective evaluations demonstrate the effectiveness of using our pipeline for providing natural and expressive data for VC, suggesting the potential of NaturalVoices for broader speech generation tasks.
- Published
- 2024
19. Style Mixture of Experts for Expressive Text-To-Speech Synthesis
- Author
-
Jawaid, Ahad, Chandra, Shreeram Suresh, Lu, Junchen, and Sisman, Berrak
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Computation and Language ,Computer Science - Machine Learning ,Computer Science - Sound - Abstract
Recent advances in style transfer text-to-speech (TTS) have improved the expressiveness of synthesized speech. Despite these advancements, encoding stylistic information from diverse and unseen reference speech remains challenging. This paper introduces StyleMoE, an approach that divides the embedding space, modeled by the style encoder, into tractable subsets handled by style experts. The proposed method replaces the style encoder in a TTS system with a Mixture of Experts (MoE) layer. By utilizing a gating network to route reference speeches to different style experts, each expert specializes in aspects of the style space during optimization. Our experiments objectively and subjectively demonstrate the effectiveness of our proposed method in increasing the coverage of the style space for diverse and unseen styles. This approach can enhance the performance of existing state-of-the-art style transfer TTS models, marking the first study of MoE in style transfer TTS to our knowledge.
- Published
- 2024
20. MultiEdits: Simultaneous Multi-Aspect Editing with Text-to-Image Diffusion Models
- Author
-
Huang, Mingzhen, Cai, Jialing, Jia, Shan, Lokhande, Vishnu Suresh, and Lyu, Siwei
- Subjects
Computer Science - Computer Vision and Pattern Recognition - Abstract
Text-driven image synthesis has made significant advancements with the development of diffusion models, transforming how visual content is generated from text prompts. Despite these advances, text-driven image editing, a key area in computer graphics, faces unique challenges. A major challenge is making simultaneous edits across multiple objects or attributes. Applying these methods sequentially for multi-aspect edits increases computational demands and efficiency losses. In this paper, we address these challenges with significant contributions. Our main contribution is the development of MultiEdits, a method that seamlessly manages simultaneous edits across multiple attributes. In contrast to previous approaches, MultiEdits not only preserves the quality of single attribute edits but also significantly improves the performance of multitasking edits. This is achieved through an innovative attention distribution mechanism and a multi-branch design that operates across several processing heads. Additionally, we introduce the PIE-Bench++ dataset, an expansion of the original PIE-Bench dataset, to better support evaluating image-editing tasks involving multiple objects and attributes simultaneously. This dataset is a benchmark for evaluating text-driven image editing methods in multifaceted scenarios. Dataset and code are available at https://mingzhenhuang.com/projects/MultiEdits.html.
- Published
- 2024
21. Learning from metastable grain boundaries
- Author
-
Mishra, Avanish, Suresh, Sumit A., Fensin, Saryu J., Mathew, Nithin, and Kober, Edward M.
- Subjects
Condensed Matter - Materials Science - Abstract
Grain boundaries (GBs) govern critical properties of polycrystals. Although significant advancements have been made in characterizing minimum energy GBs, real GBs are seldom found in such states, making it challenging to establish structure-property relationships. This diversity of atomic arrangements in metastable states motivates using data-driven methods to establish these relationships. In this study, we utilize a vast atomistic database (~5000) of minimum energy and metastable states of symmetric tilt copper GBs, combined with physically-motivated local atomic environment (LAE) descriptors (Strain Functional Descriptors, SFDs) to predict GB properties. Our regression models exhibit robust predictive capabilities using only 19 descriptors, generalizing to atomic environments in nanocrystals. A significant highlight of our work is integration of an unsupervised method with SFDs to elucidate LAEs at GBs and their role in determining properties. Our research underscores the role of a physics-based representation of LAEs and efficacy of data-driven methods in establishing GB structure-property relationships.
- Published
- 2024
22. ShelfHelp: Empowering Humans to Perform Vision-Independent Manipulation Tasks with a Socially Assistive Robotic Cane
- Author
-
Agrawal, Shivendra, Nayak, Suresh, Naik, Ashutosh, and Hayes, Bradley
- Subjects
Computer Science - Robotics ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Human-Computer Interaction ,Computer Science - Machine Learning - Abstract
The ability to shop independently, especially in grocery stores, is important for maintaining a high quality of life. This can be particularly challenging for people with visual impairments (PVI). Stores carry thousands of products, with approximately 30,000 new products introduced each year in the US market alone, presenting a challenge even for modern computer vision solutions. Through this work, we present a proof-of-concept socially assistive robotic system we call ShelfHelp, and propose novel technical solutions for enhancing instrumented canes traditionally meant for navigation tasks with additional capability within the domain of shopping. ShelfHelp includes a novel visual product locator algorithm designed for use in grocery stores and a novel planner that autonomously issues verbal manipulation guidance commands to guide the user during product retrieval. Through a human subjects study, we show the system's success in locating and providing effective manipulation guidance to retrieve desired products with novice users. We compare two autonomous verbal guidance modes achieving comparable performance to a human assistance baseline and present encouraging findings that validate our system's efficiency and effectiveness and through positive subjective metrics including competence, intelligence, and ease of use., Comment: 8 pages, 14 figures and charts
- Published
- 2024
- Full Text
- View/download PDF
23. Participation in the age of foundation models
- Author
-
Suresh, Harini, Tseng, Emily, Young, Meg, Gray, Mary L., Pierson, Emma, and Levy, Karen
- Subjects
Computer Science - Computers and Society ,Computer Science - Artificial Intelligence ,Computer Science - Human-Computer Interaction ,Computer Science - Machine Learning - Abstract
Growing interest and investment in the capabilities of foundation models has positioned such systems to impact a wide array of public services. Alongside these opportunities is the risk that these systems reify existing power imbalances and cause disproportionate harm to marginalized communities. Participatory approaches hold promise to instead lend agency and decision-making power to marginalized stakeholders. But existing approaches in participatory AI/ML are typically deeply grounded in context - how do we apply these approaches to foundation models, which are, by design, disconnected from context? Our paper interrogates this question. First, we examine existing attempts at incorporating participation into foundation models. We highlight the tension between participation and scale, demonstrating that it is intractable for impacted communities to meaningfully shape a foundation model that is intended to be universally applicable. In response, we develop a blueprint for participatory foundation models that identifies more local, application-oriented opportunities for meaningful participation. In addition to the "foundation" layer, our framework proposes the "subfloor'' layer, in which stakeholders develop shared technical infrastructure, norms and governance for a grounded domain, and the "surface'' layer, in which affected communities shape the use of a foundation model for a specific downstream task. The intermediate "subfloor'' layer scopes the range of potential harms to consider, and affords communities more concrete avenues for deliberation and intervention. At the same time, it avoids duplicative effort by scaling input across relevant use cases. Through three case studies in clinical care, financial services, and journalism, we illustrate how this multi-layer model can create more meaningful opportunities for participation than solely intervening at the foundation layer., Comment: 13 pages, 2 figures. Appeared at FAccT '24
- Published
- 2024
- Full Text
- View/download PDF
24. Advancing Household Robotics: Deep Interactive Reinforcement Learning for Efficient Training and Enhanced Performance
- Author
-
Soni, Arpita, Alla, Sujatha, Dodda, Suresh, and Volikatla, Hemanth
- Subjects
Computer Science - Robotics ,Computer Science - Machine Learning - Abstract
The market for domestic robots made to perform household chores is growing as these robots relieve people of everyday responsibilities. Domestic robots are generally welcomed for their role in easing human labor, in contrast to industrial robots, which are frequently criticized for displacing human workers. But before these robots can carry out domestic chores, they need to become proficient in several minor activities, such as recognizing their surroundings, making decisions, and picking up on human behaviors. Reinforcement learning, or RL, has emerged as a key robotics technology that enables robots to interact with their environment and learn how to optimize their actions to maximize rewards. However, the goal of Deep Reinforcement Learning is to address more complicated, continuous action-state spaces in real-world settings by combining RL with Neural Networks. The efficacy of DeepRL can be further augmented through interactive feedback, in which a trainer offers real-time guidance to expedite the robot's learning process. Nevertheless, the current methods have drawbacks, namely the transient application of guidance that results in repeated learning under identical conditions. Therefore, we present a novel method to preserve and reuse information and advice via Deep Interactive Reinforcement Learning, which utilizes a persistent rule-based system. This method not only expedites the training process but also lessens the number of repetitions that instructors will have to carry out. This study has the potential to advance the development of household robots and improve their effectiveness and efficiency as learners.
- Published
- 2024
- Full Text
- View/download PDF
25. AI-Assisted Detector Design for the EIC (AID(2)E)
- Author
-
Diefenthaler, M., Fanelli, C., Gerlach, L. O., Guan, W., Horn, T., Jentsch, A., Lin, M., Nagai, K., Nayak, H., Pecar, C., Suresh, K., Vossen, A., Wang, T., and Wenaus, T.
- Subjects
Physics - Instrumentation and Detectors ,Computer Science - Artificial Intelligence - Abstract
Artificial Intelligence is poised to transform the design of complex, large-scale detectors like the ePIC at the future Electron Ion Collider. Featuring a central detector with additional detecting systems in the far forward and far backward regions, the ePIC experiment incorporates numerous design parameters and objectives, including performance, physics reach, and cost, constrained by mechanical and geometric limits. This project aims to develop a scalable, distributed AI-assisted detector design for the EIC (AID(2)E), employing state-of-the-art multiobjective optimization to tackle complex designs. Supported by the ePIC software stack and using Geant4 simulations, our approach benefits from transparent parameterization and advanced AI features. The workflow leverages the PanDA and iDDS systems, used in major experiments such as ATLAS at CERN LHC, the Rubin Observatory, and sPHENIX at RHIC, to manage the compute intensive demands of ePIC detector simulations. Tailored enhancements to the PanDA system focus on usability, scalability, automation, and monitoring. Ultimately, this project aims to establish a robust design capability, apply a distributed AI-assisted workflow to the ePIC detector, and extend its applications to the design of the second detector (Detector-2) in the EIC, as well as to calibration and alignment tasks. Additionally, we are developing advanced data science tools to efficiently navigate the complex, multidimensional trade-offs identified through this optimization process., Comment: 11 pages, 4 figures, AI4EIC 2023 proceeding
- Published
- 2024
26. Technical design report for the CODEX-$\beta$ demonstrator
- Author
-
collaboration, CODEX-b, Aielli, Giulio, Alimena, Juliette, Beacham, James, Haim, Eli Ben, Burucs, Andras, Cardarelli, Roberto, Charles, Matthew, Vidal, Xabier Cid, De Roeck, Albert, Dey, Biplab, Dobrescu, Silviu, Durmus, Ozgur, Elashri, Mohamed, Gligorov, Vladimir, Suarez, Rebeca Gonzalez, Gorordo, Thomas, Gray, Zarria, Henderson, Conor, Henry, Louis, Ilten, Philip, Johnson, Daniel, Kautz, Jacob, Knapen, Simon, Liu, Bingxuan, Liu, Yang, Solino, Saul Lopez, Mombacher, Titus, Nachman, Benjamin, Northacker, David, Nowak, Gabriel, Papucci, Michele, Pasztor, Gabriella, Rial, Eloi Pazos, Pfaller, Jake, Pizzimento, Luca, Casasus, Maximo Plo, Rassati, Gian Andrea, Robinson, Dean, Fernandez, Emilio Xose Rodriguez, Sahoo, Debashis, Simsek, Sinem, Sokoloff, Michael, Suresh, Aditya, Swallow, Paul, Swanson, James, Vari, Riccardo, Sierra, Carlos Vazquez, Veres, Gabor, Watson, Nigel, Wilkinson, Michael, and Williams, Michael
- Subjects
Physics - Instrumentation and Detectors ,High Energy Physics - Experiment - Abstract
The CODEX-$\beta$ apparatus is a demonstrator for the proposed future CODEX-b experiment, a long-lived-particle detector foreseen for operation at IP8 during HL-LHC data-taking. The demonstrator project, intended to collect data in 2025, is described, with a particular focus on the design, construction, and installation of the new apparatus.
- Published
- 2024
27. Searching for gravitational wave optical counterparts with the Zwicky Transient Facility: summary of O4a
- Author
-
Ahumada, Tomás, Anand, Shreya, Coughlin, Michael W., Gupta, Vaidehi, Kasliwal, Mansi M., Karambelkar, Viraj R., Stein, Robert D., Waratkar, Gaurav, Swain, Vishwajeet, Laz, Theophile Jegou du, Anumarlapudi, Akash, Andreoni, Igor, Bulla, Mattia, Srinivasaragavan, Gokul P., Toivonen, Andrew, Wold, Avery, Bellm, Eric C., Cenko, S. Bradley, Kaplan, David L., Sollerman, Jesper, Bhalerao, Varun, Perley, Daniel, Salgundi, Anirudh, Suresh, Aswin, Hinds, K-Ryan, Reusch, Simeon, Necker, Jannis, Cook, David O., Pletskova, Natalya, Singer, Leo P., Banerjee, Smaranika, Barna, Tyler, Copperwheat, Christopher M., Healy, Brian, Kiendrebeogo, R. Weizmann, Kumar, Harsh, Kumar, Ravi, Pezzella, Marianna, Sagues-Carracedo, Ana, Sravan, Niharika, Bloom, Joshua S., Chen, Tracy X., Graham, Matthew, Helou, George, Laher, Russ R., Mahabal, Ashish A., Purdum, Josiah, Anupama, G. C., Barway, Sudhanshu, Basu, Judhajeet, Raman, Dhananjay, and Roychowdhury, Tamojeet
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics - Abstract
During the first half of the fourth observing run (O4a) of the International Gravitational Wave Network (IGWN), the Zwicky Transient Facility (ZTF) conducted a systematic search for kilonova (KN) counterparts to binary neutron star (BNS) and neutron star-black hole (NSBH) merger candidates. Here, we present a comprehensive study of the five high-significance (FAR < 1 per year) BNS and NSBH candidates in O4a. Our follow-up campaigns relied on both target-of-opportunity observations (ToO) and re-weighting of the nominal survey schedule to maximize coverage. We describe the toolkit we have been developing, Fritz, an instance of SkyPortal, instrumental in coordinating and managing our telescope scheduling, candidate vetting, and follow-up observations through a user-friendly interface. ZTF covered a total of 2841 deg$^2$ within the skymaps of the high-significance GW events, reaching a median depth of g~20.2 mag. We circulated 15 candidates, but found no viable KN counterpart to any of the GW events. Based on the ZTF non-detections of the high-significance events in O4a, we used a Bayesian approach, nimbus, to quantify the posterior probability of KN model parameters that are consistent with our non-detections. Our analysis favors KNe with initial absolute magnitude fainter than -16 mag. The joint posterior probability of a GW170817-like KN associated with all our O4a follow-ups was 64%. Additionally, we use a survey simulation software, simsurvey, to determine that our combined filtered efficiency to detect a GW170817-like KN is 36%, when considering the 5 confirmed astrophysical events in O3 (1 BNS and 4 NSBH), along with our O4a follow-ups. Following Kasliwal et al. (2020), we derived joint constraints on the underlying KN luminosity function based on our O3 and O4a follow-ups, determining that no more than 76% of KNe fading at 1 mag/day can peak at a magnitude brighter than -17.5 mag., Comment: submitted
- Published
- 2024
28. DSAM: A Deep Learning Framework for Analyzing Temporal and Spatial Dynamics in Brain Networks
- Author
-
Thapaliya, Bishal, Miller, Robyn, Chen, Jiayu, Wang, Yu-Ping, Akbas, Esra, Sapkota, Ram, Ray, Bhaskar, Suresh, Pranav, Ghimire, Santosh, Calhoun, Vince, and Liu, Jingyu
- Subjects
Quantitative Biology - Neurons and Cognition ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
Resting-state functional magnetic resonance imaging (rs-fMRI) is a noninvasive technique pivotal for understanding human neural mechanisms of intricate cognitive processes. Most rs-fMRI studies compute a single static functional connectivity matrix across brain regions of interest, or dynamic functional connectivity matrices with a sliding window approach. These approaches are at risk of oversimplifying brain dynamics and lack proper consideration of the goal at hand. While deep learning has gained substantial popularity for modeling complex relational data, its application to uncovering the spatiotemporal dynamics of the brain is still limited. We propose a novel interpretable deep learning framework that learns goal-specific functional connectivity matrix directly from time series and employs a specialized graph neural network for the final classification. Our model, DSAM, leverages temporal causal convolutional networks to capture the temporal dynamics in both low- and high-level feature representations, a temporal attention unit to identify important time points, a self-attention unit to construct the goal-specific connectivity matrix, and a novel variant of graph neural network to capture the spatial dynamics for downstream classification. To validate our approach, we conducted experiments on the Human Connectome Project dataset with 1075 samples to build and interpret the model for the classification of sex group, and the Adolescent Brain Cognitive Development Dataset with 8520 samples for independent testing. Compared our proposed framework with other state-of-art models, results suggested this novel approach goes beyond the assumption of a fixed connectivity matrix and provides evidence of goal-specific brain connectivity patterns, which opens up the potential to gain deeper insights into how the human brain adapts its functional connectivity specific to the task at hand., Comment: 18 Pages, 4 figures
- Published
- 2024
29. Dust-ion-acoustic damped solitary waves and shocks in laboratory and Saturn's E-ring magnetized nonthermal dusty plasmas with anisotropic ion pressure and dust-charge fluctuation
- Author
-
Acharya, Num Prasad, Basnet, Suresh, Misra, Amar P., and Khanal, Raju
- Subjects
Physics - Plasma Physics - Abstract
We study the oblique propagation of weakly nonlinear dust-ion-acoustic (DIA) solitary waves (SWs) and shocks in collisional magnetized nonthermal dusty plasmas that are relevant in laboratory and space (Saturn's E-ring) environments. We consider plasmas to be composed of $q$-nonextensive hot electrons, thermal positive ions, and immobile negatively charged dust grains immersed in a static magnetic field and take into account the effects of ion creation (source), and ion loss (sink), ion-neutral and ion-dust collisions, anisotropic ion pressure and dust-charge fluctuations on the evolution of small-amplitude SWs and shocks. The ion-neutral collision enhancement equilibrium dust-charge number is self-consistently determined using Newton's Raphson method. We found that in laboratory dusty plasmas with adiabatic dust-charge variation [i.e., when the dust charging frequency ($\nu_{\rm{ch}}$) is much higher than the dust-plasma oscillation frequency ($\omega_{\rm{pd}}$)], the DIA solitary waves (DIASWs) get damped by the effects of the ion-dust and ion-neutral collisions, whereas the ion creation and ion loss leads to the amplification of solitary waves, and they appear as only compressive types with positive potential. On the other hand, in Saturn's E-ring plasmas, where the collisional and ion creation or ion loss effects are insignificant, the non-adiabaticity of dust-charge variation can give rise to the evolution of either damped DIASWs or DIA shocks, depending on the smallness of the ratios $\nu_{\rm{ch}}/\omega_{\rm{pd}}$ or $\omega_{\rm{pd}}/\nu_{\rm{ch}}$, respectively. Furthermore, two critical values of the nonextensive parameter $q$ exist, below (or above) which, the DIASWs and shocks can appear as rarefactive (or compressive) types. The characteristics of DIASWs and shocks are also analyzed numerically for parameters relevant to the laboratory and Saturn's E-ring plasmas., Comment: Total 19 pages and 21 Figures with some Figure number has subplots
- Published
- 2024
30. Exploring speech style spaces with language models: Emotional TTS without emotion labels
- Author
-
Chandra, Shreeram Suresh, Du, Zongyang, and Sisman, Berrak
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Machine Learning - Abstract
Many frameworks for emotional text-to-speech (E-TTS) rely on human-annotated emotion labels that are often inaccurate and difficult to obtain. Learning emotional prosody implicitly presents a tough challenge due to the subjective nature of emotions. In this study, we propose a novel approach that leverages text awareness to acquire emotional styles without the need for explicit emotion labels or text prompts. We present TEMOTTS, a two-stage framework for E-TTS that is trained without emotion labels and is capable of inference without auxiliary inputs. Our proposed method performs knowledge transfer between the linguistic space learned by BERT and the emotional style space constructed by global style tokens. Our experimental results demonstrate the effectiveness of our proposed framework, showcasing improvements in emotional accuracy and naturalness. This is one of the first studies to leverage the emotional correlation between spoken content and expressive delivery for emotional TTS., Comment: Accepted at Speaker Odyssey 2024
- Published
- 2024
31. Unveiling the Direct Piezoelectric Effect on Piezo-phototronic Coupling in Ferroelectrics: First Principle Study Assisted Experimental Approach
- Author
-
Samantaray, Koyal Suman, Kumar, Sourabh, Maneesha, P, Sasmal, Dilip, Baral, Suresh Chandra, Krupa, B. R. Vaishnavi, Dasgupta, Arup, Harrabi, K, Mekki, A, and Sen, Somaditya
- Subjects
Condensed Matter - Materials Science - Abstract
A new study explores the distinct roles of spontaneous polarization and piezoelectric polarization in piezo-phototronic coupling. This investigation focuses on differences in photocatalytic and piezo-photocatalytic performance using sodium bismuth titanate (NBT), a key ferroelectric material. The research aims to identify which type of polarization has a greater influence on piezo-phototronic effects. A theoretical assessment complements the experimental findings, providing additional insights. This study explores the enhanced piezo-phototronic performance of electrospun nanofibers compared to sol-gel particles under different illumination conditions (11W UV, 250W UV, and natural sunlight). Electrospun nanofibers exhibited a rate constant (k) improvement of 2.5 to 3.75 times, whereas sol-gel particles showed only 1.3 to 1.4 times higher performance when ultrasonication was added to photocatalysis. Analysis using first-principle methods revealed that nanofibers had an elastic modulus (C33) about 2.15 times lower than sol-gel particles, indicating greater flexibility. The elongation of lattice along z-axis in the case of nanofibers reduced the covalency in the Bi-O and Ti-O bonds. These structural differences led to reduced spontaneous polarization and piezoelectric stress coefficients (e31 & e33). Despite having lower piezoelectric stress coefficients, higher flexibility in nanofibers led to a higher piezoelectric strain coefficient, 2.66 and 1.97 times greater than sol-gel particles, respectively. This improved the piezo-phototronic coupling for nanofibers.
- Published
- 2024
32. Exploring the Interplay of Interpretability and Robustness in Deep Neural Networks: A Saliency-guided Approach
- Author
-
Guesmi, Amira, Aswani, Nishant Suresh, and Shafique, Muhammad
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Cryptography and Security - Abstract
Adversarial attacks pose a significant challenge to deploying deep learning models in safety-critical applications. Maintaining model robustness while ensuring interpretability is vital for fostering trust and comprehension in these models. This study investigates the impact of Saliency-guided Training (SGT) on model robustness, a technique aimed at improving the clarity of saliency maps to deepen understanding of the model's decision-making process. Experiments were conducted on standard benchmark datasets using various deep learning architectures trained with and without SGT. Findings demonstrate that SGT enhances both model robustness and interpretability. Additionally, we propose a novel approach combining SGT with standard adversarial training to achieve even greater robustness while preserving saliency map quality. Our strategy is grounded in the assumption that preserving salient features crucial for correctly classifying adversarial examples enhances model robustness, while masking non-relevant features improves interpretability. Our technique yields significant gains, achieving a 35\% and 20\% improvement in robustness against PGD attack with noise magnitudes of $0.2$ and $0.02$ for the MNIST and CIFAR-10 datasets, respectively, while producing high-quality saliency maps.
- Published
- 2024
33. Artificial intelligence for abnormality detection in high volume neuroimaging: a systematic review and meta-analysis
- Author
-
Agarwal, Siddharth, Wood, David A., Grzeda, Mariusz, Suresh, Chandhini, Din, Munaib, Cole, James, Modat, Marc, and Booth, Thomas C
- Subjects
Electrical Engineering and Systems Science - Image and Video Processing ,Computer Science - Computer Vision and Pattern Recognition - Abstract
Purpose: Most studies evaluating artificial intelligence (AI) models that detect abnormalities in neuroimaging are either tested on unrepresentative patient cohorts or are insufficiently well-validated, leading to poor generalisability to real-world tasks. The aim was to determine the diagnostic test accuracy and summarise the evidence supporting the use of AI models performing first-line, high-volume neuroimaging tasks. Methods: Medline, Embase, Cochrane library and Web of Science were searched until September 2021 for studies that temporally or externally validated AI capable of detecting abnormalities in first-line CT or MR neuroimaging. A bivariate random-effects model was used for meta-analysis where appropriate. PROSPERO: CRD42021269563. Results: Only 16 studies were eligible for inclusion. Included studies were not compromised by unrepresentative datasets or inadequate validation methodology. Direct comparison with radiologists was available in 4/16 studies. 15/16 had a high risk of bias. Meta-analysis was only suitable for intracranial haemorrhage detection in CT imaging (10/16 studies), where AI systems had a pooled sensitivity and specificity 0.90 (95% CI 0.85 - 0.94) and 0.90 (95% CI 0.83 - 0.95) respectively. Other AI studies using CT and MRI detected target conditions other than haemorrhage (2/16), or multiple target conditions (4/16). Only 3/16 studies implemented AI in clinical pathways, either for pre-read triage or as post-read discrepancy identifiers. Conclusion: The paucity of eligible studies reflects that most abnormality detection AI studies were not adequately validated in representative clinical cohorts. The few studies describing how abnormality detection AI could impact patients and clinicians did not explore the full ramifications of clinical implementation.
- Published
- 2024
34. Depth-resolved Characterization of Meissner Screening Breakdown in Surface Treated Niobium
- Author
-
Thoeng, Edward, Asaduzzaman, Md., Kolb, Philipp, McFadden, Ryan M. L., Morris, Gerald D., Ticknor, John O., Dunsiger, Sarah R., Karner, Victoria L., Fujimoto, Derek, Junginger, Tobias, Kiefl, Robert F., MacFarlane, W. Andrew, Li, Ruohong, Saminathan, Suresh, and Laxdal, Robert E.
- Subjects
Condensed Matter - Superconductivity ,Condensed Matter - Materials Science - Abstract
We report direct measurements of the magnetic field screening at the limits of the Meissner phase for two superconducting Nb samples. The samples are processed with two different surface treatments that have been developed for superconducting radio-frequency cavity applications -- a "baseline" treatment and an oxygen-doping ("O-doping") treatment. The measurements show: 1) that the screening length is significantly longer in the "O-doping" sample compared to the "baseline" sample; 2) that the screening length near the limits of the Meissner phase increases with applied field; 3) the evolution of the screening profile as the material transitions from the Meissner phase to the mixed phase; and 4) a demonstration of the absence of any screening profile for the highest applied field, indicative of the full flux entering the sample. Measurements are performed utilizing the $\beta$-detected nuclear magnetic resonance ($\beta$-NMR) technique that allows depth resolved studies of the local magnetic field within the first 100 nm of the surface. The study takes advantage of the $\beta$-SRF beamline, a new facility at TRIUMF, Canada, where field levels up to 200 mT are available parallel to the sample surface to replicate radio frequency (RF) fields near the Meissner breakdown limits of Nb.
- Published
- 2024
35. A transversality theorem for semi-algebraic sets with application to signal recovery from the second moment and cryo-EM
- Author
-
Bendory, Tamir, Dym, Nadav, Edidin, Dan, and Suresh, Arun
- Subjects
Computer Science - Information Theory ,Electrical Engineering and Systems Science - Signal Processing ,Mathematics - Algebraic Geometry - Abstract
Semi-algebraic priors are ubiquitous in signal processing and machine learning. Prevalent examples include a) linear models where the signal lies in a low-dimensional subspace; b) sparse models where the signal can be represented by only a few coefficients under a suitable basis; and c) a large family of neural network generative models. In this paper, we prove a transversality theorem for semi-algebraic sets in orthogonal or unitary representations of groups: with a suitable dimension bound, a generic translate of any semi-algebraic set is transverse to the orbits of the group action. This, in turn, implies that if a signal lies in a low-dimensional semi-algebraic set, then it can be recovered uniquely from measurements that separate orbits. As an application, we consider the implications of the transversality theorem to the problem of recovering signals that are translated by random group actions from their second moment. As a special case, we discuss cryo-EM: a leading technology to constitute the spatial structure of biological molecules, which serves as our prime motivation. In particular, we derive explicit bounds for recovering a molecular structure from the second moment under a semi-algebraic prior and deduce information-theoretic implications. We also obtain information-theoretic bounds for three additional applications: factoring Gram matrices, multi-reference alignment, and phase retrieval. Finally, we deduce bounds for designing permutation invariant separators in machine learning.
- Published
- 2024
36. PDCCH Scheduling via Maximum Independent Set
- Author
-
Maggi, Lorenzo, Rial, Alvaro Valcarce, Herzog, Aloïs, Kalyanasundaram, Suresh, and Agrawal, Rakshak
- Subjects
Computer Science - Information Theory - Abstract
In 5G, the Physical Downlink Control CHannel (PDCCH) carries crucial information enabling the User Equipment (UE) to connect in UL and DL. UEs are unaware of the frequency location at which PDCCH is encoded, hence they need to perform blind decoding over a limited set of possible candidates. We address the problem faced by the gNodeB of selecting PDCCH candidates for each UE to optimize data transmission. We formulate it as a Maximum Weighted Independent Set (MWIS) problem, that is known to be an NP-hard problem and cannot even be approximated. A solution method called Weight-to-Degree Ratio (WDR) Greedy emerges as a strong contender for practical implementations due to its favorable performance-to-complexity trade-off and theoretical performance guarantees.
- Published
- 2024
37. Examining Changes in Internal Representations of Continual Learning Models Through Tensor Decomposition
- Author
-
Aswani, Nishant Suresh, Guesmi, Amira, Hanif, Muhammad Abdullah, and Shafique, Muhammad
- Subjects
Computer Science - Machine Learning - Abstract
Continual learning (CL) has spurred the development of several methods aimed at consolidating previous knowledge across sequential learning. Yet, the evaluations of these methods have primarily focused on the final output, such as changes in the accuracy of predicted classes, overlooking the issue of representational forgetting within the model. In this paper, we propose a novel representation-based evaluation framework for CL models. This approach involves gathering internal representations from throughout the continual learning process and formulating three-dimensional tensors. The tensors are formed by stacking representations, such as layer activations, generated from several inputs and model `snapshots', throughout the learning process. By conducting tensor component analysis (TCA), we aim to uncover meaningful patterns about how the internal representations evolve, expecting to highlight the merits or shortcomings of examined CL strategies. We conduct our analyses across different model architectures and importance-based continual learning strategies, with a curated task selection. While the results of our approach mirror the difference in performance of various CL strategies, we found that our methodology did not directly highlight specialized clusters of neurons, nor provide an immediate understanding the evolution of filters. We believe a scaled down version of our approach will provide insight into the benefits and pitfalls of using TCA to study continual learning dynamics.
- Published
- 2024
38. New bound on the Yukawa coupling from CMB
- Author
-
B, Anupama and Suresh, P K
- Subjects
Physics - General Physics - Abstract
We investigate the one loop inflation stemming from the superstring theory in the braneworld scenario. The tensor to scalar ratio of the loop inflation is found inconsistent with the recent CMB results for the Yukawa coupling from the SM sector. We propose a new bound on the Yukawa coupling, $9.92<\lambda<13.4$, applicable to the cosmological sector from CMB. The aftermath of this new bound is explored. The present results may shed some light on the phenomenology of superstring theory and its associated phenomena., Comment: 10 pages, 2 tables and 2 figuers
- Published
- 2024
39. Molecular analysis of oncogenicity associated gene 'vil8' of serotype 1 Marek's disease virus isolates from India
- Author
-
Suresh, P., Rajeswar, J. Johnson, Sukumar, K., Harikrishnan, T.J., and Srinivasan, P.
- Published
- 2020
- Full Text
- View/download PDF
40. Antibiotic Suscepyibility Pattern of Staphylococcus Aureus and Methicillin-Resistant Staphylococcus Aureus Isolated from Various Clinical Specimens in a Tertiary Care Teaching Hospital, Pondicherry
- Author
-
Krishna, P. Vamsi Muni, Reddy, V. Sreenivasulu, Kumar, V. Praveen, and Suresh, P.
- Published
- 2019
- Full Text
- View/download PDF
41. Method Development and Validation of Ezogabine by using HPTLC Method
- Author
-
Tamilselvi, N., Arivukkarasu, R., Suresh, P., Suriyan, N., Thiramilan, A., and Valarmathi, C.
- Published
- 2019
- Full Text
- View/download PDF
42. Systematic benchmarking of single-cell ATAC-sequencing protocols.
- Author
-
De Rop, Florian, Hulselmans, Gert, Flerin, Chris, Soler-Vila, Paula, Rafels, Albert, Christiaens, Valerie, González-Blas, Carmen, Marchese, Domenica, Caratù, Ginevra, Poovathingal, Suresh, Rozenblatt-Rosen, Orit, Slyper, Michael, Luo, Wendy, Muus, Christoph, Duarte, Fabiana, Shrestha, Rojesh, Bagdatli, S, Corces, M, Mamanova, Lira, Knights, Andrew, Meyer, Kerstin, Mulqueen, Ryan, Taherinasab, Akram, Maschmeyer, Patrick, Pezoldt, Jörn, Lambert, Camille, Iglesias, Marta, Najle, Sebastián, Dossani, Zain, Martelotto, Luciano, Burkett, Zach, Lebofsky, Ronald, Martin-Subero, José, Pillai, Satish, Sebé-Pedrós, Arnau, Deplancke, Bart, Teichmann, Sarah, Ludwig, Leif, Braun, Theodore, Adey, Andrew, Greenleaf, William, Buenrostro, Jason, Regev, Aviv, Aerts, Stein, and Heyn, Holger
- Subjects
Humans ,Single-Cell Analysis ,Benchmarking ,Leukocytes ,Mononuclear ,Chromatin Immunoprecipitation Sequencing ,Chromatin ,Transposases ,Sequence Analysis ,DNA ,High-Throughput Nucleotide Sequencing - Abstract
Single-cell assay for transposase-accessible chromatin by sequencing (scATAC-seq) has emerged as a powerful tool for dissecting regulatory landscapes and cellular heterogeneity. However, an exploration of systemic biases among scATAC-seq technologies has remained absent. In this study, we benchmark the performance of eight scATAC-seq methods across 47 experiments using human peripheral blood mononuclear cells (PBMCs) as a reference sample and develop PUMATAC, a universal preprocessing pipeline, to handle the various sequencing data formats. Our analyses reveal significant differences in sequencing library complexity and tagmentation specificity, which impact cell-type annotation, genotype demultiplexing, peak calling, differential region accessibility and transcription factor motif enrichment. Our findings underscore the importance of sample extraction, method selection, data processing and total cost of experiments, offering valuable guidance for future research. Finally, our data and analysis pipeline encompasses 169,000 PBMC scATAC-seq profiles and a best practices code repository for scATAC-seq data analysis, which are freely available to extend this benchmarking effort to future protocols.
- Published
- 2024
43. Global arthropod beta-diversity is spatially and temporally structured by latitude.
- Author
-
Seymour, Mathew, Roslin, Tomas, deWaard, Jeremy R, Perez, Kate HJ, D'Souza, Michelle L, Ratnasingham, Sujeevan, Ashfaq, Muhammad, Levesque-Beaudin, Valerie, Blagoev, Gergin A, Bukowski, Belén, Cale, Peter, Crosbie, Denise, Decaëns, Thibaud, deWaard, Stephanie L, Ekrem, Torbjørn, El-Ansary, Hosam O, Evouna Ondo, Fidèle, Fraser, David, Geiger, Matthias F, Hajibabaei, Mehrdad, Hallwachs, Winnie, Hanisch, Priscila E, Hausmann, Axel, Heath, Mark, Hogg, Ian D, Janzen, Daniel H, Kinnaird, Margaret, Kohn, Joshua R, Larrivée, Maxim, Lees, David C, León-Règagnon, Virginia, Liddell, Michael, Lijtmaer, Darío A, Lipinskaya, Tatsiana, Locke, Sean A, Manjunath, Ramya, Martins, Dino J, Martins, Marlúcia B, Mazumdar, Santosh, McKeown, Jaclyn TA, Anderson-Teixeria, Kristina, Miller, Scott E, Milton, Megan A, Miskie, Renee, Morinière, Jérôme, Mutanen, Marko, Naik, Suresh, Nichols, Becky, Noguera, Felipe A, Novotny, Vojtech, Penev, Lyubomir, Pentinsaari, Mikko, Quinn, Jenna, Ramsay, Leah, Rochefort, Regina, Schmidt, Stefan, Smith, M Alex, Sobel, Crystal N, Somervuo, Panu, Sones, Jayme E, Staude, Hermann S, St Jaques, Brianne, Stur, Elisabeth, Telfer, Angela C, Tubaro, Pablo L, Wardlaw, Tim J, Worcester, Robyn, Yang, Zhaofu, Young, Monica R, Zemlak, Tyler, Zakharov, Evgeny V, Zlotnick, Bradley, Ovaskainen, Otso, and Hebert, Paul DN
- Subjects
Animals ,Arthropods ,Biodiversity ,Geography ,Spatio-Temporal Analysis ,Biological Sciences ,Ecology ,Evolutionary Biology ,Life Below Water ,Biological sciences ,Biomedical and clinical sciences - Abstract
Global biodiversity gradients are generally expected to reflect greater species replacement closer to the equator. However, empirical validation of global biodiversity gradients largely relies on vertebrates, plants, and other less diverse taxa. Here we assess the temporal and spatial dynamics of global arthropod biodiversity dynamics using a beta-diversity framework. Sampling includes 129 sampling sites whereby malaise traps are deployed to monitor temporal changes in arthropod communities. Overall, we encountered more than 150,000 unique barcode index numbers (BINs) (i.e. species proxies). We assess between site differences in community diversity using beta-diversity and the partitioned components of species replacement and richness difference. Global total beta-diversity (dissimilarity) increases with decreasing latitude, greater spatial distance and greater temporal distance. Species replacement and richness difference patterns vary across biogeographic regions. Our findings support long-standing, general expectations of global biodiversity patterns. However, we also show that the underlying processes driving patterns may be regionally linked.
- Published
- 2024
44. Small noise perturbations of stochastic ergodic control problems
- Author
-
Kumar, K. Suresh and Desai, Vikrant
- Subjects
Mathematics - Optimization and Control ,Mathematics - Probability - Abstract
Using small noise limit approach, we study degenerate stochastic ergodic control problems and as a byproduct obtain error bounds for the $\varepsilon$-optimal controls. We also establish tunneling for a special ergodic control problem and give a representation of the ergodic value using the tunneled Markov chain.
- Published
- 2024
45. VoroTO: Multiscale Topology Optimization of Voronoi Structures using Surrogate Neural Networks
- Author
-
Padhy, Rahul Kumar, Suresh, Krishnan, and Chandrasekhar, Aaditya
- Subjects
Computer Science - Computational Engineering, Finance, and Science ,Mathematics - Numerical Analysis - Abstract
Cellular structures found in nature exhibit remarkable properties such as high strength, high energy absorption, excellent thermal/acoustic insulation, and fluid transfusion. Many of these structures are Voronoi-like; therefore researchers have proposed Voronoi multi-scale designs for a wide variety of engineering applications. However, designing such structures can be computationally prohibitive due to the multi-scale nature of the underlying analysis and optimization. In this work, we propose the use of a neural network (NN) to carry out efficient topology optimization (TO) of multi-scale Voronoi structures. The NN is first trained using Voronoi parameters (cell site locations, thickness, orientation, and anisotropy) to predict the homogenized constitutive properties. This network is then integrated into a conventional TO framework to minimize structural compliance subject to a volume constraint. Special considerations are given for ensuring positive definiteness of the constitutive matrix and promoting macroscale connectivity. Several numerical examples are provided to showcase the proposed method., Comment: Submitted to Engineering with Computers
- Published
- 2024
46. Classical integrability in the presence of a cosmological constant: analytic and machine learning results
- Author
-
Cardoso, Gabriel Lopes, Peña, Damián Mayorga, and Nampuri, Suresh
- Subjects
High Energy Physics - Theory ,Computer Science - Machine Learning ,Mathematical Physics - Abstract
We study the integrability of two-dimensional theories that are obtained by a dimensional reduction of certain four-dimensional gravitational theories describing the coupling of Maxwell fields and neutral scalar fields to gravity in the presence of a potential for the neutral scalar fields. By focusing on a certain solution subspace, we show that a subset of the equations of motion in two dimensions are the compatibility conditions for a modified version of the Breitenlohner-Maison linear system. Subsequently, we study the Liouville integrability of the 2D models encoding the chosen 4D solution subspace from a one-dimensional point of view by constructing Lax pair matrices. In this endeavour, we successfully employ a linear neural network to search for Lax pair matrices for these models, thereby illustrating how machine learning approaches can be effectively implemented to augment the identification of integrable structures in classical systems., Comment: 32 pages, 7 figures
- Published
- 2024
47. Understanding Robot Minds: Leveraging Machine Teaching for Transparent Human-Robot Collaboration Across Diverse Groups
- Author
-
Jayaraman, Suresh Kumaar, Simmons, Reid, Steinfeld, Aaron, and Admoni, Henny
- Subjects
Computer Science - Robotics - Abstract
In this work, we aim to improve transparency and efficacy in human-robot collaboration by developing machine teaching algorithms suitable for groups with varied learning capabilities. While previous approaches focused on tailored approaches for teaching individuals, our method teaches teams with various compositions of diverse learners using team belief representations to address personalization challenges within groups. We investigate various group teaching strategies, such as focusing on individual beliefs or the group's collective beliefs, and assess their impact on learning robot policies for different team compositions. Our findings reveal that team belief strategies yield less variation in learning duration and better accommodate diverse teams compared to individual belief strategies, suggesting their suitability in mixed-proficiency settings with limited resources. Conversely, individual belief strategies provide a more uniform knowledge level, particularly effective for homogeneously inexperienced groups. Our study indicates that the teaching strategy's efficacy is significantly influenced by team composition and learner proficiency, highlighting the importance of real-time assessment of learner proficiency and adapting teaching approaches based on learner proficiency for optimal teaching outcomes.
- Published
- 2024
48. Towards smaller, faster decoder-only transformers: Architectural variants and their implications
- Author
-
Suresh, Sathya Krishnan and P, Shunmugapriya
- Subjects
Computer Science - Machine Learning - Abstract
Research on Large Language Models (LLMs) has recently seen exponential growth, largely focused on transformer-based architectures, as introduced by [1] and further advanced by the decoder-only variations in [2]. Contemporary studies typically aim to improve model capabilities by increasing both the architecture's complexity and the volume of training data. However, research exploring how to reduce model sizes while maintaining performance is limited. This study introduces three modifications to the decoder-only transformer architecture: ParallelGPT (p-gpt), LinearlyCompressedGPT (lc-gpt), and ConvCompressedGPT (cc-gpt). These variants achieve comparable performance to conventional architectures in code generation tasks while benefiting from reduced model sizes and faster training times. We open-source the model weights and codebase to support future research and development in this domain., Comment: 8 pages, 6 figures
- Published
- 2024
49. Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository
- Author
-
Deshpande, Ajinkya, Agarwal, Anmol, Shet, Shashank, Iyer, Arun, Kanade, Aditya, Bairi, Ramakrishna, and Parthasarathy, Suresh
- Subjects
Computer Science - Software Engineering ,Computer Science - Artificial Intelligence - Abstract
LLMs have demonstrated significant potential in code generation tasks, achieving promising results at the function or statement level across various benchmarks. However, the complexities associated with creating code artifacts like classes, particularly within the context of real-world software repositories, remain underexplored. Prior research treats class-level generation as an isolated task, neglecting the intricate dependencies & interactions that characterize real-world software environments. To address this gap, we introduce RepoClassBench, a comprehensive benchmark designed to rigorously evaluate LLMs in generating complex, class-level code within real-world repositories. RepoClassBench includes "Natural Language to Class generation" tasks across Java, Python & C# from a selection of repositories. We ensure that each class in our dataset not only has cross-file dependencies within the repository but also includes corresponding test cases to verify its functionality. We find that current models struggle with the realistic challenges posed by our benchmark, primarily due to their limited exposure to relevant repository contexts. To address this shortcoming, we introduce Retrieve-Repotools-Reflect (RRR), a novel approach that equips LLMs with static analysis tools to iteratively navigate & reason about repository-level context in an agent-based framework. Our experiments demonstrate that RRR significantly outperforms existing baselines on RepoClassBench, showcasing its effectiveness across programming languages & under various settings. Our findings emphasize the critical need for code-generation benchmarks to incorporate repo-level dependencies to more accurately reflect the complexities of software development. Our work shows the benefits of leveraging specialized tools to enhance LLMs' understanding of repository context. We plan to make our dataset & evaluation harness public., Comment: Preprint with additional experiments
- Published
- 2024
50. Private federated discovery of out-of-vocabulary words for Gboard
- Author
-
Sun, Ziteng, Kairouz, Peter, Sun, Haicheng, Gascon, Adria, and Suresh, Ananda Theertha
- Subjects
Computer Science - Data Structures and Algorithms - Abstract
The vocabulary of language models in Gboard, Google's keyboard application, plays a crucial role for improving user experience. One way to improve the vocabulary is to discover frequently typed out-of-vocabulary (OOV) words on user devices. This task requires strong privacy protection due to the sensitive nature of user input data. In this report, we present a private OOV discovery algorithm for Gboard, which builds on recent advances in private federated analytics. The system offers local differential privacy (LDP) guarantees for user contributed words. With anonymous aggregation, the final released result would satisfy central differential privacy guarantees with $\varepsilon = 0.315, \delta = 10^{-10}$ for OOV discovery in en-US (English in United States).
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.