101. Efficient and Accurate Inference of Mixed Microbial Population Trajectories from Longitudinal Count Data
- Author
-
Tyler A. Joseph, Amey P. Pasarkar, and Itsik Pe'er
- Subjects
Data Analysis ,Histology ,Computer science ,Population ,Inference ,Astrophysics::Cosmology and Extragalactic Astrophysics ,computer.software_genre ,Pathology and Forensic Medicine ,03 medical and health sciences ,0302 clinical medicine ,Humans ,Microbiome ,Longitudinal Studies ,education ,Astrophysics::Galaxy Astrophysics ,030304 developmental biology ,0303 health sciences ,education.field_of_study ,Orders of magnitude (acceleration) ,Microbiota ,Sampling (statistics) ,Cell Biology ,Quantitative Biology::Genomics ,Research Design ,Noise (video) ,Data mining ,computer ,030217 neurology & neurosurgery ,Human Microbiome Project ,Count data - Abstract
Summary The recently completed second phase of the Human Microbiome Project has highlighted the relationship between dynamic changes in the microbiome and disease, motivating new microbiome study designs based on longitudinal sampling. Yet, analysis of such data is hindered by presence of technical noise, high dimensionality, and data sparsity. Here, we introduce LUMINATE (longitudinal microbiome inference and zero detection), a fast and accurate method for inferring relative abundances from noisy read count data. We demonstrate that LUMINATE is orders of magnitude faster than current approaches, with better or similar accuracy. We further show that LUMINATE can accurately distinguish biological zeros, when a taxon is absent from the community, from technical zeros, when a taxon is below the detection threshold. We conclude by demonstrating the utility of LUMINATE on a real dataset, showing that LUMINATE smooths trajectories observed from noisy data. LUMINATE is freely available from https://github.com/tyjo/luminate .
- Published
- 2020