Back to Search
Start Over
Driver Missense Mutation Identification Using Feature Selection and Model Fusion
- Source :
- Journal of Computational Biology. 22:1075-1085
- Publication Year :
- 2015
- Publisher :
- Mary Ann Liebert Inc, 2015.
-
Abstract
- Driver mutations propel oncogenesis and occur much less frequently than passenger mutations. The need for automatic and accurate identification of driver mutations has increased dramatically with the exponential growth of mutation data. Current computational solutions to identify driver mutations rely on sequence homology. Here we construct a machine learning-based framework that does not rely on sequence homology or domain knowledge to predict driver missense mutations. A windowing approach to represent the local environment of the sequence around the mutation point as a mutation sample is applied, followed by extraction of three sequence-level features from each sample. After selecting the most significant features, the support vector machine and multimodal fusion strategies are employed to give final predictions. The proposed framework achieves relatively high performance and outperforms current state-of-the-art algorithms. The ease of deploying the proposed framework and the relatively accurate performance make this solution applicable to large-scale mutation data analyses.
- Subjects :
- Support Vector Machine
Mutation, Missense
Feature selection
Biology
Exponential growth
Neoplasms
Genetics
Animals
Humans
Missense mutation
Molecular Biology
Simulation
Sequence
Models, Genetic
business.industry
Pattern recognition
Genomics
Support vector machine
Computational Mathematics
Identification (information)
Computational Theory and Mathematics
Modeling and Simulation
Mutation (genetic algorithm)
Domain knowledge
Artificial intelligence
business
Subjects
Details
- ISSN :
- 15578666 and 10665277
- Volume :
- 22
- Database :
- OpenAIRE
- Journal :
- Journal of Computational Biology
- Accession number :
- edsair.doi.dedup.....a04d6f2a37c2b521efe5d2cc08cd9770