Back to Search
Start Over
DEPP: Deep Learning Enables Extending Species Trees using Single Genes.
- Source :
-
Systematic biology [Syst Biol] 2023 May 19; Vol. 72 (1), pp. 17-34. - Publication Year :
- 2023
-
Abstract
- Placing new sequences onto reference phylogenies is increasingly used for analyzing environmental samples, especially microbiomes. Existing placement methods assume that query sequences have evolved under specific models directly on the reference phylogeny. For example, they assume single-gene data (e.g., 16S rRNA amplicons) have evolved under the GTR model on a gene tree. Placement, however, often has a more ambitious goal: extending a (genome-wide) species tree given data from individual genes without knowing the evolutionary model. Addressing this challenging problem requires new directions. Here, we introduce Deep-learning Enabled Phylogenetic Placement (DEPP), an algorithm that learns to extend species trees using single genes without prespecified models. In simulations and on real data, we show that DEPP can match the accuracy of model-based methods without any prior knowledge of the model. We also show that DEPP can update the multilocus microbial tree-of-life with single genes with high accuracy. We further demonstrate that DEPP can combine 16S and metagenomic data onto a single tree, enabling community structure analyses that take advantage of both sources of data. [Deep learning; gene tree discordance; metagenomics; microbiome analyses; neural networks; phylogenetic placement.].<br /> (© The Author(s) 2022. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.)
- Subjects :
- Phylogeny
RNA, Ribosomal, 16S genetics
Algorithms
Deep Learning
Microbiota genetics
Subjects
Details
- Language :
- English
- ISSN :
- 1076-836X
- Volume :
- 72
- Issue :
- 1
- Database :
- MEDLINE
- Journal :
- Systematic biology
- Publication Type :
- Academic Journal
- Accession number :
- 35485976
- Full Text :
- https://doi.org/10.1093/sysbio/syac031