1. Impact of library preparation protocols and template quantity on the metagenomic reconstruction of a mock microbial community
- Author
-
Alicia Clum, Doina Ciobanu, Susannah G. Tringe, Tanja Woyke, Robert M. Bowers, Joanne Lim, Jan Fang Cheng, Hope Tice, Kanwar Singh, and Chew Yee Ngan
- Subjects
Bioinformatics ,Genomics ,Computational biology ,Biology ,Genome ,Medical and Health Sciences ,Contig Mapping ,Information and Computing Sciences ,Genetics ,Genomic library ,Biomass ,Gene Library ,Low biomass ,Library preparation protocol ,Base Composition ,Bacteria ,Shotgun sequencing ,Methodology Article ,Microbiota ,Human Genome ,DNA ,Sequence Analysis, DNA ,Biological Sciences ,Archaea ,Microbial population biology ,Metagenomics ,Low input ,Metagenome ,Microbiome ,DNA microarray ,Sequence Analysis ,GC-content ,Biotechnology - Abstract
Background The rapid development of sequencing technologies has provided access to environments that were either once thought inhospitable to life altogether or that contain too few cells to be analyzed using genomics approaches. While 16S rRNA gene microbial community sequencing has revolutionized our understanding of community composition and diversity over time and space, it only provides a crude estimate of microbial functional and metabolic potential. Alternatively, shotgun metagenomics allows comprehensive sampling of all genetic material in an environment, without any underlying primer biases. Until recently, one of the major bottlenecks of shotgun metagenomics has been the requirement for large initial DNA template quantities during library preparation. Results Here, we investigate the effects of varying template concentrations across three low biomass library preparation protocols on their ability to accurately reconstruct a mock microbial community of known composition. We analyze the effects of input DNA quantity and library preparation method on library insert size, GC content, community composition, assembly quality and metagenomic binning. We found that library preparation method and the amount of starting material had significant impacts on the mock community metagenomes. In particular, GC content shifted towards more GC rich sequences at the lower input quantities regardless of library prep method, the number of low quality reads that could not be mapped to the reference genomes increased with decreasing input quantities, and the different library preparation methods had an impact on overall metagenomic community composition. Conclusions This benchmark study provides recommendations for library creation of representative and minimally biased metagenome shotgun sequencing, enabling insights into functional attributes of low biomass ecosystem microbial communities. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2063-6) contains supplementary material, which is available to authorized users.
- Published
- 2015