13 results on '"Prathosh, AP"'
Search Results
2. Boundary Preserving Twin Energy-Based-Models for Image to Image Translation
- Author
-
Prathosh AP, Kinjawl Bhattacharyya, and Piyush Tiwary
- Abstract
Domain shift refers to change of distributional characteristics between the training (source) and the testing (target) datasets of a learning task, leading to performance drop. For tasks involving medical images, domain shift may be caused because of several factors such as change in underlying imaging modalities, measuring devices and staining mechanisms. Recent approaches address this issue via generative models based on the principles of adversarial learning albeit they suffer from issues such as difficulty in training and lack of diversity. Motivated by the aforementioned observations, we adapt an alternative class of deep generative models called the Energy Based Models (EBMs) for the task of unpaired image-to-image translation of medical images. Specifically, we propose a novel method called the Boundary Preserving Twin EBMs (BPT-EBM) which employs a pair of EBMs in the latent space of an Auto-Encoder trained on the source data. While one of the EBMs translates the source to the target domain the other does vice-versa along with a novel boundary preserving loss, ensuring translation symmetry and coupling between the domains. We theoretically analyze the proposed method and show that our design leads to better translation between the domains with reduced langevin mixing steps. We demonstrate the efficacy of our method through detailed quantitative and qualitative experiments on image segmentation tasks on three different datasets vis-a-vis state-of-the-art methods.
- Published
- 2022
3. Machine learning-based ABA treatment recommendation and personalization for autism spectrum disorder: an exploratory study
- Author
-
Manu, Kohli, Arpan Kumar, Kar, Anjali, Bangalore, and Prathosh, Ap
- Subjects
Neurology ,Cognitive Neuroscience ,Computer Science Applications - Abstract
Autism spectrum is a brain development condition that impairs an individual’s capacity to communicate socially and manifests through strict routines and obsessive–compulsive behavior. Applied behavior analysis (ABA) is the gold-standard treatment for autism spectrum disorder (ASD). However, as the number of ASD cases increases, there is a substantial shortage of licensed ABA practitioners, limiting the timely formulation, revision, and implementation of treatment plans and goals. Additionally, the subjectivity of the clinician and a lack of data-driven decision-making affect treatment quality. We address these obstacles by applying two machine learning algorithms to recommend and personalize ABA treatment goals for 29 study participants with ASD. The patient similarity and collaborative filtering methods predicted ABA treatment with an average accuracy of 81–84%, with a normalized discounted cumulative gain of 79–81% (NDCG) compared to clinician-prepared ABA treatment recommendations. Additionally, we assess the two models’ treatment efficacy (TE) by measuring the percentage of recommended treatment goals mastered by the study participants. The proposed treatment recommendation and personalization strategy are generalizable to other intervention methods in addition to ABA and for other brain disorders. This study was registered as a clinical trial on November 5, 2020 with trial registration number CTRI/2020/11/028933.
- Published
- 2022
4. Machine Learning-based ABA treatmentrecommendation and personalization for AutismSpectrum Disorder: An exploratory study
- Author
-
Manu kohli, Prathosh AP, Arpan Kumar Kar, and Anjali BANGALORE
- Abstract
Autism spectrum is a brain development conditionthat impairs an individual’s capacity tocommunicate socially and manifests itself throughstrict routines and obsessive-compulsive behavior.Applied behavior analysis (ABA) is thegold-standard treatment for autism spectrumdisorder (ASD). However, as the number of ASDcases increases, there is a substantial shortage oflicensed ABA practitioners, limiting the timelyformulation, revision, and implementation oftreatment plans and goals. Additionally, thesubjectivity of the clinician and a lack ofdata-driven decision-making affect treatmentquality. We address these obstacles by applying twomachine learning algorithms to recommend andpersonalize ABA treatment goals for 29 studyparticipants with ASD. The patient similarity andcollaborative filtering methods predicted ABAtreatment with an average accuracy of 81-84percent, with a normalized discounted cumulativegain of 79-81 percent (NDCG) compared toclinician-prepared ABA treatmentrecommendations. Additionally, we assess the twomodels’ treatment efficacy (TE) by measuring thepercentage of recommended treatment goalsmastered by the study participants. The proposedtreatment recommendation and personalizationstrategy are generalizable to other interventionmethods in addition to ABA and for other braindisorders. This study has been registered as aclinical trial on November 5, 2020 with trailregistration number CTRI/2020/11/028933
- Published
- 2022
5. RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns
- Author
-
Prathosh Ap and Arnab Kumar Mondal
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Speech production ,Voice activity detection ,Computer science ,Speech recognition ,Image and Video Processing (eess.IV) ,Optical flow ,Machine Learning (stat.ML) ,Electrical Engineering and Systems Science - Image and Video Processing ,Speech processing ,Signal ,Machine Learning (cs.LG) ,Sound recording and reproduction ,Statistics - Machine Learning ,FOS: Electrical engineering, electronic engineering, information engineering ,Electrical and Electronic Engineering ,Instrumentation ,Sensory cue ,Energy (signal processing) - Abstract
Voice Activity Detection (VAD) refers to the task of identification of regions of human speech in digital signals such as audio and video. While VAD is a necessary first step in many speech processing systems, it poses challenges when there are high levels of ambient noise during the audio recording. To improve the performance of VAD in such conditions, several methods utilizing the visual information extracted from the region surrounding the mouth/lip region of the speakers' video recording have been proposed. Even though these provide advantages over audio-only methods, they depend on faithful extraction of lip/mouth regions. Motivated by these, a new paradigm for VAD based on the fact that respiration forms the primary source of energy for speech production is proposed. Specifically, an audio-independent VAD technique using the respiration pattern extracted from the speakers' video is developed. The Respiration Pattern is first extracted from the video focusing on the abdominal-thoracic region of a speaker using an optical flow based method. Subsequently, voice activity is detected from the respiration pattern signal using neural sequence-to-sequence prediction models. The efficacy of the proposed method is demonstrated through experiments on a challenging dataset recorded in real acoustic environments and compared with four previous methods based on audio and visual cues., Accepted in IEEE Sensor Letters
- Published
- 2020
6. Guided Weak Supervision for Action Recognition with Scarce Data to Assess Skills of Children with Autism
- Author
-
Manu Kohli, Prashant Pandey, Prathosh Ap, and Josh Pritchard
- Subjects
FOS: Computer and information sciences ,Scarce data ,Artificial neural network ,business.industry ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Supervised learning ,Computer Science - Computer Vision and Pattern Recognition ,General Medicine ,Machine learning ,computer.software_genre ,medicine.disease ,ComputingMethodologies_PATTERNRECOGNITION ,medicine ,Action recognition ,Autism ,Artificial intelligence ,business ,computer ,Classifier (UML) - Abstract
Diagnostic and intervention methodologies for skill assessment of autism typically requires a clinician repetitively initiating several stimuli and recording the child's response. In this paper, we propose to automate the response measurement through video recording of the scene following the use of Deep Neural models for human action recognition from videos. However, supervised learning of neural networks demand large amounts of annotated data that are hard to come by. This issue is addressed by leveraging the `similarities' between the action categories in publicly available large-scale video action (source) datasets and the dataset of interest. A technique called guided weak supervision is proposed, where every class in the target data is matched to a class in the source data using the principle of posterior likelihood maximization. Subsequently, classifier on the target data is re-trained by augmenting samples from the matched source classes, along with a new loss encouraging inter-class separability. The proposed method is evaluated on two skill assessment autism datasets, SSBD and a real world Autism dataset comprising 37 children of different ages and ethnicity who are diagnosed with autism. Our proposed method is found to improve the performance of the state-of-the-art multi-class human action recognition models in-spite of supervision with scarce data., AAAI 2020
- Published
- 2020
7. scRAE: Deterministic Regularized Autoencoders With Flexible Priors for Clustering Single-Cell Gene Expression Data
- Author
-
Arnab Kumar Mondal, Prathosh Ap, Parag Singla, and Himanshu Asnani
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer science ,Gene Expression ,Strong prior ,Machine learning ,computer.software_genre ,Machine Learning (cs.LG) ,Data modeling ,Prior probability ,Genetics ,Cluster Analysis ,Cluster analysis ,Dropout (neural networks) ,Artificial neural network ,business.industry ,Sequence Analysis, RNA ,Applied Mathematics ,Dimensionality reduction ,Deep learning ,Gene Expression Profiling ,Artificial intelligence ,Single-Cell Analysis ,business ,computer ,Biotechnology - Abstract
Clustering single-cell RNA sequence (scRNA-seq) data poses statistical and computational challenges due to their high-dimensionality and data-sparsity, also known as `dropout' events. Recently, Regularized Auto-Encoder (RAE) based deep neural network models have achieved remarkable success in learning robust low-dimensional representations. The basic idea in RAEs is to learn a non-linear mapping from the high-dimensional data space to a low-dimensional latent space and vice-versa, simultaneously imposing a distributional prior on the latent space, which brings in a regularization effect. This paper argues that RAEs suffer from the infamous problem of bias-variance trade-off in their naive formulation. While a simple AE without a latent regularization results in data over-fitting, a very strong prior leads to under-representation and thus bad clustering. To address the above issues, we propose a modified RAE framework (called the scRAE) for effective clustering of the single-cell RNA sequencing data. scRAE consists of deterministic AE with a flexibly learnable prior generator network, which is jointly trained with the AE. This facilitates scRAE to trade-off better between the bias and variance in the latent space. We demonstrate the efficacy of the proposed method through extensive experimentation on several real-world single-cell Gene expression datasets., Comment: IEEE/ACM Transactions on Computational Biology and Bioinformatics
- Published
- 2021
8. xViTCOS: Explainable Vision Transformer Based COVID-19 Screening Using Radiography
- Author
-
Parag Singla, Arnab Bhattacharjee, Arnab Kumar Mondal, and Prathosh Ap
- Subjects
Coronavirus disease 2019 (COVID-19) ,Computer science ,Radiography ,Computer applications to medicine. Medical informatics ,Biomedical Engineering ,R858-859.7 ,Context (language use) ,Virus diseases ,Machine learning ,computer.software_genre ,vision transformer ,Article ,CT scan and CXR ,Medical technology ,Humans ,AI for COVID-19 detection ,R855-855.5 ,Transformer (machine learning model) ,SARS-CoV-2 ,Inductive bias ,business.industry ,X-Rays ,Deep learning ,COVID-19 ,deep learning ,General Medicine ,Radiography, Thoracic ,Artificial intelligence ,Transfer of learning ,business ,computer - Abstract
Objective: Since its outbreak, the rapid spread of COrona VIrus Disease 2019 (COVID-19) across the globe has pushed the health care system in many countries to the verge of collapse. Therefore, it is imperative to correctly identify COVID-19 positive patients and isolate them as soon as possible to contain the spread of the disease and reduce the ongoing burden on the healthcare system. The primary COVID-19 screening test, RT-PCR although accurate and reliable, has a long turn-around time. In the recent past, several researchers have demonstrated the use of Deep Learning (DL) methods on chest radiography (such as X-ray and CT) for COVID-19 detection. However, existing CNN based DL methods fail to capture the global context due to their inherent image-specific inductive bias. Methods: Motivated by this, in this work, we propose the use of vision transformers (instead of convolutional networks) for COVID-19 screening using the X-ray and CT images. We employ a multi-stage transfer learning technique to address the issue of data scarcity. Furthermore, we show that the features learned by our transformer networks are explainable. Results: We demonstrate that our method not only quantitatively outperforms the recent benchmarks but also focuses on meaningful regions in the images for detection (as confirmed by Radiologists), aiding not only in accurate diagnosis of COVID-19 but also in localization of the infected area. The code for our implementation can be found here - https://github.com/arnabkmondal/xViTCOS. Conclusion: The proposed method will help in timely identification of COVID-19 and efficient utilization of limited resources.
- Published
- 2021
9. Variational Inference with Latent Space Quantization for Adversarial Resilience
- Author
-
Prathosh Ap, Deepak Mishra, and Vinay Kyatham
- Subjects
Artificial neural network ,business.industry ,Computer science ,05 social sciences ,Inference ,010501 environmental sciences ,Lipschitz continuity ,01 natural sciences ,Manifold ,Data modeling ,0502 economics and business ,Classifier (linguistics) ,Code (cryptography) ,Artificial intelligence ,050207 economics ,White box ,business ,Computer Science::Cryptography and Security ,0105 earth and related environmental sciences - Abstract
Despite their tremendous success in modelling high-dimensional data manifolds, deep neural networks suffer from the threat of adversarial attacks - Existence of perceptually valid input-like samples obtained through careful perturbation that lead to degradation in the performance of the underlying model. Major concerns with existing defense mechanisms include non-generalizability across different attacks, models and large inference time. In this paper, we propose a generalized defense mechanism capitalizing on the expressive power of regularized latent space based generative models. We design an adversarial filter, devoid of access to classifier and adversaries, which makes it usable in tandem with any classifier. The basic idea is to learn a Lipschitz constrained mapping from the data manifold, incorporating adversarial perturbations, to a quantized latent space and re-map it to the true data manifold. Specifically, we simultaneously auto-encode the data manifold and its perturbations implicitly through the perturbations of the regularized and quantized generative latent space, realized using variational inference. We demonstrate the efficacy of the proposed formulation in providing resilience against multiple attack types (black and white box) and methods, while being almost real-time. Our experiments show that the proposed method surpasses the state-of-the-art techniques in several cases. The implementation code is available at - https://github.com/mayank31398/lqvae.
- Published
- 2021
10. Unsupervised Domain Adaptation for Semantic Segmentation of NIR Images Through Generative Latent Search
- Author
-
Prashant Pandey, Prathosh Ap, Sameer Ambekar, and Aayush Kumar Tyagi
- Subjects
Pixel ,Computer science ,business.industry ,Deep learning ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Inference ,Pattern recognition ,Domain (software engineering) ,Image (mathematics) ,Generative model ,Code (cryptography) ,Segmentation ,Artificial intelligence ,business - Abstract
Segmentation of the pixels corresponding to human skin is an essential first step in multiple applications ranging from surveillance to heart-rate estimation from remote-photoplethysmography. However, the existing literature considers the problem only in the visible-range of the EM-spectrum which limits their utility in low or no light settings where the criticality of the application is higher. To alleviate this problem, we consider the problem of skin segmentation from the Near-infrared images. However, Deep learning based state-of-the-art segmentation techniques demands large amounts of labelled data that is unavailable for the current problem. Therefore we cast the skin segmentation problem as that of target-independent Unsupervised Domain Adaptation (UDA) where we use the data from the Red-channel of the visible-range to develop skin segmentation algorithm on NIR images. We propose a method for target-independent segmentation where the ‘nearest-clone’ of a target image in the source domain is searched and used as a proxy in the segmentation network trained only on the source domain. We prove the existence of ‘nearest-clone’ and propose a method to find it through an optimization algorithm over the latent space of a Deep generative model based on variational inference. We demonstrate the efficacy of the proposed method for NIR skin segmentation over the state-of-the-art UDA segmentation methods on the two newly created skin segmentation datasets in NIR domain despite not having access to the target NIR data. Additionally, we report state-of-the-art results for adaption from Synthia to Cityscapes which is a popular setting in Unsupervised Domain Adaptation for semantic segmentation. The code and datasets are available at https://github.com/ambekarsameer96/GLSS.
- Published
- 2020
11. Abstract PO-036: A sophisticated bioinformatics framework for integrative study of radiomics and genomics profiles of tumors
- Author
-
Sivakumaran Theru Arumugam, Prathosh Ap, Aayush Tyagi, Shivashankar H. Nagaraj, Olivier Gevaert, Shrey S. Sukhadia, and Pritam Mukherjee
- Subjects
Cancer Research ,Oncology ,Radiomics ,Genomics ,Computational biology ,Biology - Abstract
The potential for radiomics to support oncology decision-making has grown substantially in recent years, as these scanning techniques have been found to offer unique information regarding the tumor phenotype and microenvironment that is distinct from that provided by genomic or proteomic assays. Radiomic and genomic (or proteomic) data can be correlated with one another, thereby facilitating radiogenomic efforts. Radiogenomically-informed biopsies have the potential to yield better pathological outcomes and can aid in the planning of more appropriate treatment strategies for cancer patients. However, the field lacks a unified software platform wherein radiomic and genomics/proteomic data could be brought together to conduct a variety of correlational analyses and build robust artificial intelligence models that would aid the prediction of genomic/proteomic profiles of tumors from their radiological images. We have built such a comprehensive platform that could be utilized by scientists and clinicians globally to conduct radiogenomic studies for a variety of cancer types, and further validate and deploy it in clinics to aid effective monitoring, diagnosis, and treatment of cancer patients. Citation Format: Shrey S. Sukhadia, Shivashankar H. Nagaraj, Olivier Gevaert, Sivakumaran Theru Arumugam, Aayush Tyagi, Pritam Mukherjee, A.P. Prathosh. A sophisticated bioinformatics framework for integrative study of radiomics and genomics profiles of tumors [abstract]. In: Proceedings of the AACR Virtual Special Conference on Artificial Intelligence, Diagnosis, and Imaging; 2021 Jan 13-14. Philadelphia (PA): AACR; Clin Cancer Res 2021;27(5_Suppl):Abstract nr PO-036.
- Published
- 2021
12. Prediction and imputation in irregularly sampled clinical time series data using hierarchical linear dynamical models
- Author
-
Abhishek Sengupta, Prathosh Ap, Satya Narayan Shukla, Chandan K. Reddy, and Vaibhav Rajan
- Subjects
State variable ,Computer science ,0206 medical engineering ,Linear model ,02 engineering and technology ,computer.software_genre ,020601 biomedical engineering ,01 natural sciences ,Linear dynamical system ,010104 statistics & probability ,Linear Models ,Data mining ,Imputation (statistics) ,0101 mathematics ,Time series ,Temporal difference learning ,computer ,Algorithm - Abstract
Clinical time series, comprising of repeated clinical measurements provide valuable information of the trajectory of patients' condition. Linear dynamical systems (LDS) are used extensively in science and engineering for modeling time series data. The observation and state variables in LDS are assumed to be uniformly sampled in time with a fixed sampling rate. The observation sequence for clinical time series is often irregularly sampled and LDS do not model such data well. In this paper, we develop two LDS-based models for irregularly sampled data. The key idea is to incorporate a temporal difference variable within the state equations of LDS whose parameters are estimated using observed data. Our models are evaluated on prediction and imputation tasks using real irregularly sampled clinical time series data and are found to outperform state-of-the-art techniques.
- Published
- 2017
13. Generalization on Unseen Domains via Inference-time Label-Preserving Target Projections
- Author
-
Mrigank Raman, Prathosh Ap, Prashant Pandey, and Sumanth Varambally
- Subjects
Generative model ,Optimization problem ,Source data ,business.industry ,Generalization ,Computer science ,Metric (mathematics) ,Feature (machine learning) ,Inference ,Pattern recognition ,Artificial intelligence ,business ,Classifier (UML) - Abstract
Generalization of machine learning models trained on a set of source domains on unseen target domains with different statistics, is a challenging problem. While many approaches have been proposed to solve this problem, they only utilize source data during training but do not take advantage of the fact that a single target example is available at the time of inference. Motivated by this, we propose a method that effectively uses the target sample during inference beyond mere classification. Our method has three components - (i) A label-preserving feature or metric transformation on source data such that the source samples are clustered in accordance with their class irrespective of their domain (ii) A generative model trained on the these features (iii) A label-preserving projection of the target point on the source-feature manifold during inference via solving an optimization problem on the input space of the generative model using the learned metric. Finally, the projected target is used in the classifier. Since the projected target feature comes from the source manifold and has the same label as the real target by design, the classifier is expected to perform better on it than the true target. We demonstrate that our method outperforms the state-of-the-art Domain Generalization methods on multiple datasets and tasks.
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.