Back to Search Start Over

Clustering and graph mining techniques for classification of complex structural variations in cancer genomes

Authors :
Gonzalo Gomez-Sanchez
Luisa Delgado-Serrano
David Carrera
David Torrents
Josep Ll. Berral
Source :
Scientific Reports, Vol 12, Iss 1, Pp 1-12 (2022)
Publication Year :
2022
Publisher :
Nature Portfolio, 2022.

Abstract

Abstract For many years, a major question in cancer genomics has been the identification of those variations that can have a functional role in cancer, and distinguish from the majority of genomic changes that have no functional consequences. This is particularly challenging when considering complex chromosomal rearrangements, often composed of multiple DNA breaks, resulting in difficulties in classifying and interpreting them functionally. Despite recent efforts towards classifying structural variants (SVs), more robust statistical frames are needed to better classify these variants and isolate those that derive from specific molecular mechanisms. We present a new statistical approach to analyze SVs patterns from 2392 tumor samples from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium and identify significant recurrence, which can inform relevant mechanisms involved in the biology of tumors. The method is based on recursive KDE clustering of 152,926 SVs, randomization methods, graph mining techniques and statistical measures. The proposed methodology was able not only to identify complex patterns across different cancer types but also to prove them as not random occurrences. Furthermore, a new class of pattern that was not previously described has been identified.

Subjects

Subjects :
Medicine
Science

Details

Language :
English
ISSN :
20452322
Volume :
12
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Scientific Reports
Publication Type :
Academic Journal
Accession number :
edsdoj.4caee8fb9fbe436198b673e1a541cfee
Document Type :
article
Full Text :
https://doi.org/10.1038/s41598-022-07211-6