Back to Search Start Over

A deep siamese neural network improves metagenome-assembled genomes in microbiome datasets across different environments.

Authors :
Pan, Shaojun
Zhu, Chengkai
Zhao, Xing-Ming
Coelho, Luis Pedro
Source :
Nature Communications; 4/28/2022, Vol. 13 Issue 1, p1-12, 12p
Publication Year :
2022

Abstract

Metagenomic binning is the step in building metagenome-assembled genomes (MAGs) when sequences predicted to originate from the same genome are automatically grouped together. The most widely-used methods for binning are reference-independent, operating de novo and enable the recovery of genomes from previously unsampled clades. However, they do not leverage the knowledge in existing databases. Here, we introduce SemiBin, an open source tool that uses deep siamese neural networks to implement a semi-supervised approach, i.e. SemiBin exploits the information in reference genomes, while retaining the capability of reconstructing high-quality bins that are outside the reference dataset. Using simulated and real microbiome datasets from several different habitats from GMGCv1 (Global Microbial Gene Catalog), including the human gut, non-human guts, and environmental habitats (ocean and soil), we show that SemiBin outperforms existing state-of-the-art binning methods. In particular, compared to other methods, SemiBin returns more high-quality bins with larger taxonomic diversity, including more distinct genera and species. Here, the authors present SemiBin, a siamese deep neural network framework that incorporates information from reference genomes, able to extract better metagenome-assembled genomes (MAGs) in several host-associated and environmental habitats. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20411723
Volume :
13
Issue :
1
Database :
Complementary Index
Journal :
Nature Communications
Publication Type :
Academic Journal
Accession number :
156745015
Full Text :
https://doi.org/10.1038/s41467-022-29843-y