1. Accurate, multi-kb reads resolve complex populations and detect rare microorganisms
- Author
-
Sharon, I, Kertesz, M, Hug, LA, Pushkarev, D, Blauwkamp, TA, Castelle, CJ, Amirebrahimi, M, Thomas, BC, Burstein, D, Tringe, SG, Williams, KH, and Banfield, JF
- Subjects
Deltaproteobacteria ,Geologic Sediments ,Genome ,Base Sequence ,Hydrolases ,Bioinformatics ,Microbial Consortia ,Human Genome ,Bacterial ,Chloroflexi ,DNA ,Biodiversity ,Biological Sciences ,Medical and Health Sciences ,Glucose ,Bacterial Proteins ,Genetics ,Metagenomics ,Sequence Analysis - Abstract
© 2015 Sharon et al. Accurate evaluation of microbial communities is essential for understanding global biogeochemical processes and can guide bioremediation and medical treatments. Metagenomics is most commonly used to analyze microbial diversity and metabolic potential, but assemblies of the short reads generated by current sequencing platforms may fail to recover heterogeneous strain populations and rare organisms. Here we used short (150-bp) and long (multi-kb) synthetic reads to evaluate strain heterogeneity and study microorganisms at low abundance in complex microbial communities from terrestrial sediments. The long-read data revealed multiple (probably dozens of) closely related species and strains from previously undescribed Deltaproteobacteria and Aminicenantes (candidate phylum OP8). Notably, these are the most abundant organisms in the communities, yet short-read assemblies achieved only partial genome coverage, mostly in the form of short scaffolds (N50 = ∼2200 bp). Genome architecture and metabolic potential for these lineages were reconstructed using a new synteny-based method. Analysis of long-read data also revealed thousands of species whose abundances were
- Published
- 2015
- Full Text
- View/download PDF