Back to Search
Start Over
PROTEIN FOLD CLASSIFICATION WITH GENETIC ALGORITHMS AND FEATURE SELECTION.
- Source :
-
Journal of Bioinformatics & Computational Biology . Oct2009, Vol. 7 Issue 5, p773-788. 16p. 4 Diagrams, 2 Charts, 4 Graphs. - Publication Year :
- 2009
-
Abstract
- Protein fold classification is a key step to predicting protein tertiary structures. This paper proposes a novel approach based on genetic algorithms and feature selection to classifying protein folds. Our dataset is divided into a training dataset and a test dataset. Each individual for the genetic algorithms represents a selection function of the feature vectors of the training dataset. A support vector machine is applied to each individual to evaluate the fitness value (fold classification rate) of each individual. The aim of the genetic algorithms is to search for the best individual that produces the highest fold classification rate. The best individual is then applied to the feature vectors of the test dataset and a support vector machine is built to classify protein folds based on selected features. Our experimental results on Ding and Dubchak's benchmark dataset of 27-class folds show that our approach achieves an accuracy of 71.28%, which outperforms current state-of-the-art protein fold predictors. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 02197200
- Volume :
- 7
- Issue :
- 5
- Database :
- Academic Search Index
- Journal :
- Journal of Bioinformatics & Computational Biology
- Publication Type :
- Academic Journal
- Accession number :
- 44340340
- Full Text :
- https://doi.org/10.1142/S0219720009004321