Back to Search
Start Over
Technology-specific error signatures in the 1000 Genomes Project data
- Source :
- Human genetics. 130(4)
- Publication Year :
- 2010
-
Abstract
- Next-generation sequencing (NGS) will likely facilitate a better understanding of the causes and consequences of human genetic variability. In this context, the validity of NGS-inferred single-nucleotide variants (SNVs) is of paramount importance. We therefore developed a statistical framework to assess the fidelity of three common NGS platforms. Using aligned DNA sequence data from two completely sequenced HapMap samples as included in the 1000 Genomes Project, we unraveled remarkably different error profiles for the three platforms. Compared to confirmed HapMap variants, newly identified SNVs included a substantial proportion of false positives (3–17%). Consensus calling by more than one platform yielded significantly lower error rates (1–4%). This implies that the use of multiple NGS platforms may be more cost-efficient than relying upon a single technology alone, particularly in physically localized sequencing experiments that rely upon small error rates. Our study thus highlights that different NGS platforms suit different practical applications differently well, and that NGS-based studies require stringent data quality control for their results to be valid.
- Subjects :
- Genetics
Quality Control
Genome, Human
media_common.quotation_subject
Fidelity
Genetic Variation
High-Throughput Nucleotide Sequencing
Context (language use)
Computational biology
Sequence Analysis, DNA
Biology
DNA sequencing
Data quality
Human Genome Project
False positive paradox
Humans
Genetic variability
1000 Genomes Project
International HapMap Project
Artifacts
Genetics (clinical)
Algorithms
media_common
Subjects
Details
- ISSN :
- 14321203
- Volume :
- 130
- Issue :
- 4
- Database :
- OpenAIRE
- Journal :
- Human genetics
- Accession number :
- edsair.doi.dedup.....950a7c8aa225aa593b12f1f107bb4145