Back to Search
Start Over
Scores of amino acid 0D-3D information as applied in cleavage site prediction and better specificity elucidation for human immunodeficiency virus type 1 protease
- Source :
- Science in China Series B: Chemistry. 51:794-800
- Publication Year :
- 2008
- Publisher :
- Springer Science and Business Media LLC, 2008.
-
Abstract
- A new set of descriptors, namely score vectors of the zero dimension, one dimension, two dimensions and three dimensions (SZOTT), was derived from principle component analysis of a matrix of 1369 structural variables including 0D, 1D, 2D and 3D information for the 20 coded amino acids. SZOTT scales were then used in cleavage site prediction of human immunodeficiency virus type 1 protease. Linear discriminant analysis (LDA) and support vector machines (SVM) were applied to developing models to predict the cleavage sites. The results obtained by linear discriminant analysis (LDA) and support vector machines (SVM) are as follows. The Matthews correlation coefficients (MCC) by the resubstitution test, leave-one-out cross validation (LOOCV) and external validation are 0.879 and 0.911, 0.849 and 0.901, 0.822 and 0.846, respectively. The receiver operating characteristic (ROC) analysis showed that the SVM model possesses better simulative and predictive ability in comparison with the LDA model. Satisfactory results show that SZOTT descriptors can be further used to predict cleavage sites of human immunodeficiency virus type 1 protease.
- Subjects :
- Protease
Receiver operating characteristic
business.industry
medicine.medical_treatment
Pattern recognition
General Chemistry
Linear discriminant analysis
Cleavage (embryo)
Cross-validation
Support vector machine
Correlation
Principal component analysis
medicine
Artificial intelligence
business
Mathematics
Subjects
Details
- ISSN :
- 18622771 and 10069291
- Volume :
- 51
- Database :
- OpenAIRE
- Journal :
- Science in China Series B: Chemistry
- Accession number :
- edsair.doi...........9ad674c2c284f6ca708431626ea35cb9
- Full Text :
- https://doi.org/10.1007/s11426-008-0088-2