Back to Search
Start Over
Using somatic variant richness to mine signals from rare variants in the cancer genome
- Source :
- Nature Communications, Vol 10, Iss 1, Pp 1-9 (2019), Nature Communications
- Publication Year :
- 2019
- Publisher :
- Nature Portfolio, 2019.
-
Abstract
- To date, the vast preponderance of somatic variants observed in the cancer genome have been rare variants, and it is common in practice to encounter in a new tumor variants that have not been observed previously. Here we focus on probability estimation for encountering such hitherto unseen variants. We draw upon statistical methodology that has been developed in other fields of study, notably in species estimation in ecology, and word frequency estimation in computational linguistics. Analysis of whole-exome and targeted panel sequencing data sets reveal substantial variability in variant “richness” between genes that could be harnessed for clinically relevant problems. We quantify the variant-tissue association and show a strong gene-specific, lineage-dependent pattern of encountering new variants. This variability is largely determined by the proportion of observed variants that are rare. Our findings suggest that variants that occur at very low frequencies can harbor important signals that are clinically consequential.<br />Sequencing cancer genomes reveals low frequency novel somatic variants without known function. Here, the authors leverage statistical methodology from the fields of computational linguistics and ecology to highlight the potentially important signals harboured by these novel variants that are often dismissed.
- Subjects :
- 0301 basic medicine
Statistical methods
Somatic cell
Ecology (disciplines)
Science
General Physics and Astronomy
Biology
01 natural sciences
Genome
Article
General Biochemistry, Genetics and Molecular Biology
010104 statistics & probability
03 medical and health sciences
Phylogenetics
Neoplasms
Cancer genome
Cancer genomics
Humans
Genetic Predisposition to Disease
0101 mathematics
lcsh:Science
Gene
Phylogeny
Probability
Multidisciplinary
Genome, Human
Reproducibility of Results
General Chemistry
Word lists by frequency
030104 developmental biology
Organ Specificity
Evolutionary biology
Mutation
lcsh:Q
Species richness
Subjects
Details
- Language :
- English
- ISSN :
- 20411723
- Volume :
- 10
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- Nature Communications
- Accession number :
- edsair.doi.dedup.....114ccb35995384e196c289cd95e7e344