Back to Search
Start Over
Comparative Exploratory Analysis of Intrinsically Disordered Protein Dynamics Using Machine Learning and Network Analytic Methods
- Source :
- Frontiers in Molecular Biosciences, Vol 6 (2019), Frontiers in Molecular Biosciences
- Publication Year :
- 2019
- Publisher :
- Frontiers Media S.A., 2019.
-
Abstract
- Simulations of intrinsically disordered proteins (IDP) pose numerous challenges to comparative analysis, prominently including highly dynamic conformational states and a lack of well-defined secondary structure. Machine learning (ML) algorithms are especially effective at discriminating among high-dimensional inputs whose differences are extremely subtle, making them well suited to the study of IDPs. In this work, we apply various ML techniques, including support vector machines (SVM) and clustering, as well as related methods such as principal component analysis (PCA) and protein structure network (PSN) analysis, to the problem of uncovering differences between configurational data from molecular dynamics simulations of two variants of the same IDP. We examine molecular dynamics (MD) trajectories of wild-type amyloid beta (Abeta 1-40) and its ``Arctic'' variant (E22G), systems that play a central role in the etiology of Alzheimer's disease. Our analyses demonstrate ways in which ML and related approaches can be used to elucidate subtle differences between these proteins, including transient structure that is poorly captured by conventional metrics.
- Subjects :
- 0301 basic medicine
Computer science
protein structure networks
Machine learning
computer.software_genre
Intrinsically disordered proteins
Biochemistry, Genetics and Molecular Biology (miscellaneous)
Biochemistry
support vector machines
03 medical and health sciences
amyloid fibrils
0302 clinical medicine
Protein structure
Molecular Biosciences
Cluster analysis
Protein secondary structure
Molecular Biology
lcsh:QH301-705.5
Original Research
business.industry
Protein dynamics
Exploratory analysis
molecular dynamics
Support vector machine
amyloid beta
030104 developmental biology
machine learning
lcsh:Biology (General)
030220 oncology & carcinogenesis
Principal component analysis
Artificial intelligence
intrinsically disordered proteins
business
computer
clustering
Subjects
Details
- Language :
- English
- Volume :
- 6
- Database :
- OpenAIRE
- Journal :
- Frontiers in Molecular Biosciences
- Accession number :
- edsair.doi.dedup.....43623c583f2ded5f3498dd9b8963a345