Back to Search
Start Over
Exome sequence read depth methods for identifying copy number changes
- Source :
- Briefings in Bioinformatics. 16:380-392
- Publication Year :
- 2014
- Publisher :
- Oxford University Press (OUP), 2014.
-
Abstract
- Copy number variants (CNVs) play important roles in a number of human diseases and in pharmacogenetics. Powerful methods exist for CNV detection in whole genome sequencing (WGS) data, but such data are costly to obtain. Many disease causal CNVs span or are found in genome coding regions (exons), which makes CNV detection using whole exome sequencing (WES) data attractive. If reliably validated against WGS-based CNVs, exome-derived CNVs have potential applications in a clinical setting. Several algorithms have been developed to exploit exome data for CNV detection and comparisons made to find the most suitable methods for particular data samples. The results are not consistent across studies. Here, we review some of the exome CNV detection methods based on depth of coverage profiles and examine their performance to identify problems contributing to discrepancies in published results. We also present a streamlined strategy that uses a single metric, the likelihood ratio, to compare exome methods, and we demonstrated its utility using the VarScan 2 and eXome Hidden Markov Model (XHMM) programs using paired normal and tumour exome data from chronic lymphocytic leukaemia patients. We use array-based somatic CNV (SCNV) calls as a reference standard to compute prevalence-independent statistics, such as sensitivity, specificity and likelihood ratio, for validation of the exome-derived SCNVs. We also account for factors known to influence the performance of exome read depth methods, such as CNV size and frequency, while comparing our findings with published results.
- Subjects :
- DNA Copy Number Variations
Computer science
Molecular Sequence Data
Read depth
Computational biology
Sensitivity and Specificity
Genome
Pattern Recognition, Automated
Humans
Exome
Copy-number variation
Hidden Markov model
Molecular Biology
Exome sequencing
Sequence (medicine)
Genetics
Whole genome sequencing
Base Sequence
Chromosome Mapping
Reproducibility of Results
DNA, Neoplasm
Sequence Analysis, DNA
Leukemia, Lymphocytic, Chronic, B-Cell
Data Interpretation, Statistical
Algorithms
Information Systems
Subjects
Details
- ISSN :
- 14774054 and 14675463
- Volume :
- 16
- Database :
- OpenAIRE
- Journal :
- Briefings in Bioinformatics
- Accession number :
- edsair.doi.dedup.....eb28ca7904ff0267354e9f18ec8640b5
- Full Text :
- https://doi.org/10.1093/bib/bbu027