Back to Search
Start Over
Regression applied to protein binding site prediction and comparison with classification
- Source :
- BMC Bioinformatics, Vol 10, Iss 1, p 276 (2009), BMC bioinformatics, Vol. 10, p. 276 (2009), BMC Bioinformatics
- Publication Year :
- 2009
- Publisher :
- BMC, 2009.
-
Abstract
- Background The structural genomics centers provide hundreds of protein structures of unknown function. Therefore, developing methods enabling the determination of a protein function automatically is imperative. The determination of a protein function can be achieved by studying the network of its physical interactions. In this context, identifying a potential binding site between proteins is of primary interest. In the literature, methods for predicting a potential binding site location generally are based on classification tools. The aim of this paper is to show that regression tools are more efficient than classification tools for patches based binding site predictors. For this purpose, we developed a patches based binding site localization method usable with either regression or classification tools. Results We compared predictive performances of regression tools with performances of machine learning classifiers. Using leave-one-out cross-validation, we showed that regression tools provide better predictions than classification ones. Among regression tools, Multilayer Perceptron ranked highest in the quality of predictions. We compared also the predictive performance of our patches based method using Multilayer Perceptron with the performance of three other methods usable through a web server. Our method performed similarly to the other methods. Conclusion Regression is more efficient than classification when applied to our binding site localization method. When it is possible, using regression instead of classification for other existing binding site predictors will probably improve results. Furthermore, the method presented in this work is flexible because the size of the predicted binding site is adjustable. This adaptability is useful when either false positive or negative rates have to be limited.
- Subjects :
- Web server
Computer science
Context (language use)
computer.software_genre
Machine learning
lcsh:Computer applications to medicine. Medical informatics
Biochemistry
Structural genomics
Protein structure
Structural Biology
Protein Interaction Mapping
Binding site
Databases, Protein
Molecular Biology
lcsh:QH301-705.5
Binding Sites
business.industry
Applied Mathematics
A protein
Proteins
Computational Biology
Regression analysis
Function (mathematics)
Classification
Regression
Computer Science Applications
ComputingMethodologies_PATTERNRECOGNITION
lcsh:Biology (General)
Multilayer perceptron
protein interaction mesh surface regression
Regression Analysis
lcsh:R858-859.7
Artificial intelligence
DNA microarray
business
computer
Algorithms
Research Article
Subjects
Details
- Language :
- English
- ISSN :
- 14712105
- Volume :
- 10
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics
- Accession number :
- edsair.doi.dedup.....939583ac6ec8125b8ab77fc744ab508b