Back to Search
Start Over
Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage
- Source :
- Nucleic Acids Research, Chakraborty, M; Baldwin-Brown, JG; Long, AD; & Emerson, JJ. (2016). Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage. NUCLEIC ACIDS RESEARCH, 44(19). doi: 10.1093/nar/gkw654. UC Irvine: Retrieved from: http://www.escholarship.org/uc/item/0t15p040, Nucleic acids research, vol 44, iss 19
- Publication Year :
- 2015
- Publisher :
- Cold Spring Harbor Laboratory, 2015.
-
Abstract
- Genome assemblies that are accurate, complete, and contiguous are essential for identifying important structural and functional elements of genomes and for identifying genetic variation. Nevertheless, most recent genome assemblies remain incomplete and fragmented. While long molecule sequencing promises to deliver more complete genome assemblies with fewer gaps, concerns about error rates, low yields, stringent DNA requirements, and uncertainty about best practices may discourage many investigators from adopting this technology. Here, in conjunction with the platinum standard Drosophila melanogaster reference genome, we analyze recently published long molecule sequencing data to identify what governs completeness and contiguity of genome assemblies. We also present a hybrid meta-assembly approach that achieves remarkable assembly contiguity for both Drosophila and human assemblies with only modest long molecule sequencing coverage. Our results motivate a set of preliminary best practices for obtaining accurate and contiguous assemblies, a “missing manual” that guides key decisions in building high quality de novo genome assemblies, from DNA isolation to polishing the assembly.
- Subjects :
- 0106 biological sciences
0301 basic medicine
Computer science
Contiguity
Sequencing data
Sequence assembly
Hybrid genome assembly
Genomics
Computational biology
Biology
01 natural sciences
Genome
Cell Line
Contiguity (probability theory)
03 medical and health sciences
chemistry.chemical_compound
0302 clinical medicine
Information and Computing Sciences
Genetic variation
Genetics
Animals
Humans
030304 developmental biology
0303 health sciences
Human Genome
Computational Biology
High-Throughput Nucleotide Sequencing
DNA
Sequence Analysis, DNA
Biological Sciences
DNA extraction
Drosophila melanogaster
030104 developmental biology
chemistry
Methods Online
Generic health relevance
Sequence Analysis
Environmental Sciences
030217 neurology & neurosurgery
Developmental Biology
010606 plant biology & botany
Reference genome
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Nucleic Acids Research, Chakraborty, M; Baldwin-Brown, JG; Long, AD; & Emerson, JJ. (2016). Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage. NUCLEIC ACIDS RESEARCH, 44(19). doi: 10.1093/nar/gkw654. UC Irvine: Retrieved from: http://www.escholarship.org/uc/item/0t15p040, Nucleic acids research, vol 44, iss 19
- Accession number :
- edsair.doi.dedup.....33284b7a6ed9fc6eb82c3435dbca9c43