Back to Search
Start Over
Identification of single nucleotide variants using position-specific error estimation in deep sequencing data
- Source :
- BMC Medical Genomics, Vol 12, Iss 1, Pp 1-12 (2019), BMC Medical Genomics
- Publication Year :
- 2019
- Publisher :
- BMC, 2019.
-
Abstract
- BackgroundTargeted deep sequencing is a highly effective technology to identify known and novel single nucleotide variants (SNVs) with many applications in translational medicine, disease monitoring and cancer profiling. However, identification of SNVs using deep sequencing data is a challenging computational problem as different sequencing artifacts limit the analytical sensitivity of SNV detection, especially at low variant allele frequencies (VAFs).MethodsTo address the problem of relatively high noise levels in amplicon-based deep sequencing data (e.g. with the Ion AmpliSeq technology) in the context of SNV calling, we have developed a new bioinformatics tool called AmpliSolve. AmpliSolve uses a set of normal samples to model position-specific, strand-specific and nucleotide-specific background artifacts (noise), and deploys a Poisson model-based statistical framework for SNV detection.ResultsOur tests on both synthetic and real data indicate that AmpliSolve achieves a good trade-off between precision and sensitivity, even at VAF below 5% and as low as 1%. We further validate AmpliSolve by applying it to the detection of SNVs in 96 circulating tumor DNA samples at three clinically relevant genomic positions and compare the results to digital droplet PCR experiments.ConclusionsAmpliSolve is a new tool for in-silico estimation of background noise and for detection of low frequency SNVs in targeted deep sequencing data. Although AmpliSolve has been specifically designed for and tested on amplicon-based libraries sequenced with the Ion Torrent platform it can, in principle, be applied to other sequencing platforms as well. AmpliSolve is freely available at https://github.com/dkleftogi/AmpliSolve.
- Subjects :
- lcsh:Internal medicine
Deep sequencing
lcsh:QH426-470
Computer science
Computational biology
Polymorphism, Single Nucleotide
Circulating Tumor DNA
Liquid biopsies
03 medical and health sciences
0302 clinical medicine
Variant calling
Cancer genomics
Humans
Nucleotide
Error correction
lcsh:RC31-1245
030304 developmental biology
chemistry.chemical_classification
0303 health sciences
Ion torrent
High-Throughput Nucleotide Sequencing
Reproducibility of Results
Variant allele
Ion semiconductor sequencing
Amplicon
3. Good health
Benchmarking
lcsh:Genetics
Next generation sequencing (NGS)
Technical Advance
chemistry
Circulating tumor DNA
030220 oncology & carcinogenesis
Targeted sequencing
Subjects
Details
- Language :
- English
- ISSN :
- 17558794
- Volume :
- 12
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BMC Medical Genomics
- Accession number :
- edsair.doi.dedup.....c89ceb288e1b89bdab4cf243f38b2729
- Full Text :
- https://doi.org/10.1186/s12920-019-0557-9