Back to Search
Start Over
OncodriveCLUSTL: a sequence-based clustering method to identify cancer drivers
- Source :
- Bioinformatics
- Publication Year :
- 2019
- Publisher :
- Oxford University Press, 2019.
-
Abstract
- Motivation: Identification of the genomic alterations driving tumorigenesis is one of the main goals in oncogenomics research. Given the evolutionary principles of cancer development, computational methods that detect signals of positive selection in the pattern of tumor mutations have been effectively applied in the search for cancer genes. One of these signals is the abnormal clustering of mutations, which has been shown to be complementary to other signals in the detection of driver genes. Results: We have developed OncodriveCLUSTL, a new sequence-based clustering algorithm to detect significant clustering signals across genomic regions. OncodriveCLUSTL is based on a local background model derived from the simulation of mutations accounting for the composition of tri- or penta-nucleotide context substitutions observed in the cohort under study. Our method can identify known clusters and bona-fide cancer drivers across cohorts of tumor whole-exomes, outperforming the existing OncodriveCLUST algorithm and complementing other methods based on different signals of positive selection. Our results indicate that OncodriveCLUSTL can be applied to the analysis of non-coding genomic elements and non-human mutations data. Availability and implementation: OncodriveCLUSTL is available as an installable Python 3.5 package. The source code and running examples are freely available at https://bitbucket.org/bbglab/oncodriveclustl under GNU Affero General Public License. Supplementary information: Supplementary data are available at Bioinformatics online. This work was supported by funding from the Spanish Ministry of Economy and Competitiveness [SAF2015-66084-R, MINECO/FEDER, UE] and by the European Research Council [Consolidator Grant 68239]. IRB Barcelona is the recipient of a Severo Ochoa Centre of Excellence Award from the Spanish Ministry of Economy and Competitiveness (MINECO; Government of Spain) and is supported by CERCA (Generalitat de Catalunya). A.G.-P. is supported by a Ramón y Cajal contract from the Spanish Ministry of Economy and Competitiveness [RYC-2013-1455]. C.A.-P. is supported by “la Caixa” Foundation (ID 100010434) with code [LCF/BQ/ES18/11670011].
- Subjects :
- Statistics and Probability
Computer science
media_common.quotation_subject
Library science
Context (language use)
Computational biology
Oncogenomics
medicine.disease_cause
Biochemistry
030218 nuclear medicine & medical imaging
03 medical and health sciences
0302 clinical medicine
Excellence
Political science
Neoplasms
medicine
Cluster Analysis
Humans
Cluster analysis
Molecular Biology
media_common
030304 developmental biology
Government
0303 health sciences
Sequence
European research
Cancer
Genomics
Genome Analysis
medicine.disease
Applications Notes
Corrigenda
Computer Science Applications
Computational Mathematics
Identification (information)
Computational Theory and Mathematics
030220 oncology & carcinogenesis
Christian ministry
Carcinogenesis
Software
Subjects
Details
- Language :
- English
- ISSN :
- 13674811 and 13674803
- Volume :
- 35
- Issue :
- 24
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....1afc77c4158c9ffaef765c969de2b4e3