Back to Search
Start Over
Analysis of deep learning methods for blind protein contact prediction in CASP12
- Publication Year :
- 2017
- Publisher :
- Cold Spring Harbor Laboratory, 2017.
-
Abstract
- Here we present the results of protein contact prediction achieved in CASP12 by our RaptorX-Contact server, which is an early implementation of our deep learning method for contact prediction. On a set of 38 free-modeling target domains with a median family size of around 58 effective sequences, our server obtained an average top L/5 long- and medium-range contact accuracy of 47% and 44%, respectively (L=length). A more advanced implementation has an average accuracy of 59% and 57%, respectively. Our deep learning method formulates contact prediction as an image pixel-level labeling problem and simultaneously predicts all residue pairs of a protein using a combination of two deep residual neural networks, taking as input the residue conservation information, predicted secondary structure and solvent accessibility, contact potential, and co-evolution information. Our approach differs from existing methods mainly in (1) formulating contact prediction as a pixel-level image labeling problem instead of an image-level classification problem; (2) simultaneously predicting all contacts of an individual protein to make effective use of contact occurrence patterns; and (3) integrating both 1D and 2D deep convolutional neural networks to effectively learn complex sequence-structure relationship including high-order residue correlation. This paper discusses the RaptorX-Contact pipeline, both contact prediction and contact-based folding results, and finally the strength and weakness of our method.
- Subjects :
- 0301 basic medicine
Models, Molecular
Protein Folding
Computer science
Protein Conformation
Residual
computer.software_genre
Crystallography, X-Ray
Biochemistry
Convolutional neural network
Article
Correlation
03 medical and health sciences
Software
0302 clinical medicine
Deep Learning
Structural Biology
Humans
CASP
Molecular Biology
Protein secondary structure
030304 developmental biology
0303 health sciences
Artificial neural network
business.industry
Deep learning
A protein
Computational Biology
Proteins
Pattern recognition
Solvent accessibility
Image labeling
030104 developmental biology
Data mining
Artificial intelligence
Neural Networks, Computer
business
computer
030217 neurology & neurosurgery
Algorithms
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....0a7f84f2e5a1ad288cf22624a2979644
- Full Text :
- https://doi.org/10.1101/181586