Back to Search Start Over

Prediction of amyloid aggregation rates by machine learning and feature selection.

Authors :
Yang, Wuyue
Tan, Pengzhen
Fu, Xianjun
Hong, Liu
Source :
Journal of Chemical Physics; 8/28/2019, Vol. 151 Issue 8, pN.PAG-N.PAG, 9p, 1 Diagram, 2 Charts, 3 Graphs
Publication Year :
2019

Abstract

A novel data-based machine learning algorithm for predicting amyloid aggregation rates is reported in this paper. Based on a highly nonlinear projection from 16 intrinsic features of a protein and 4 extrinsic features of the environment to the protein aggregation rate, a feedforward fully connected neural network (FCN) with one hidden layer is trained on a dataset composed of 21 different kinds of amyloid proteins and tested on 4 rest proteins. FCN shows a much better performance than traditional algorithms, such as multivariable linear regression and support vector regression, with an average accuracy higher than 90%. Furthermore, by the correlation analysis and the principal component analysis, seven key features, folding energy, HP patterns for helix, sheet and helices cross membrane, pH, ionic strength, and protein concentration, are shown to constitute a minimum feature set for characterizing the amyloid aggregation kinetics. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00219606
Volume :
151
Issue :
8
Database :
Complementary Index
Journal :
Journal of Chemical Physics
Publication Type :
Academic Journal
Accession number :
138370782
Full Text :
https://doi.org/10.1063/1.5113848