Thuc Duy Le, Antonio Colaprico, Gianluca Bontempi, Rujing Wang, Su Ning, Lin Liu, Bingyu Sun, Taosheng Xu, Jiuyong Li, Xu, Taosheng, Le, Thuc Duy, Liu, Lin, Su, Ning, Wang, Rujing, Sun, Bingyu, Colaprico, Antonio, Bontempi, Gianluca, and Li, Jiuyong
Summary Identifying molecular cancer subtypes from multi-omics data is an important step in the personalized medicine. We introduce CancerSubtypes, an R package for identifying cancer subtypes using multi-omics data, including gene expression, miRNA expression and DNA methylation data. CancerSubtypes integrates four main computational methods which are highly cited for cancer subtype identification and provides a standardized framework for data pre-processing, feature selection, and result follow-up analyses, including results computing, biology validation and visualization. The input and output of each step in the framework are packaged in the same data format, making it convenience to compare different methods. The package is useful for inferring cancer subtypes from an input genomic dataset, comparing the predictions from different well-known methods and testing new subtype discovery methods, as shown with different application scenarios in the Supplementary Material. Availability and implementation The package is implemented in R and available under GPL-2 license from the Bioconductor website (http://bioconductor.org/packages/CancerSubtypes/). Supplementary information Supplementary data are available at Bioinformatics online.