Back to Search
Start Over
Revisiting the 'satisfaction of spatial restraints' approach of MODELLER for protein homology modeling
- Source :
- PLoS computational biology 15 (2019). doi:10.1371/journal.pcbi.1007219, info:cnr-pdr/source/autori:Janson, Giacomo; Grottesi, Alessandro; Pietrosanto, Marco; Ausiello, Gabriele; Guarguaglini, Giulia; Paiardini, Alessandro/titolo:Revisiting the "satisfaction of spatial restraints" approach of MODELLER for protein homology modeling/doi:10.1371%2Fjournal.pcbi.1007219/rivista:PLoS computational biology/anno:2019/pagina_da:/pagina_a:/intervallo_pagine:/volume:15, PLoS Computational Biology, PLoS Computational Biology, Vol 15, Iss 12, p e1007219 (2019)
- Publication Year :
- 2019
- Publisher :
- Public Library of Science, San Francisco, CA , Stati Uniti d'America, 2019.
-
Abstract
- The most frequently used approach for protein structure prediction is currently homology modeling. The 3D model building phase of this methodology is critical for obtaining an accurate and biologically useful prediction. The most widely employed tool to perform this task is MODELLER. This program implements the “modeling by satisfaction of spatial restraints” strategy and its core algorithm has not been altered significantly since the early 1990s. In this work, we have explored the idea of modifying MODELLER with two effective, yet computationally light strategies to improve its 3D modeling performance. Firstly, we have investigated how the level of accuracy in the estimation of structural variability between a target protein and its templates in the form of σ values profoundly influences 3D modeling. We show that the σ values produced by MODELLER are on average weakly correlated to the true level of structural divergence between target-template pairs and that increasing this correlation greatly improves the program’s predictions, especially in multiple-template modeling. Secondly, we have inquired into how the incorporation of statistical potential terms (such as the DOPE potential) in the MODELLER’s objective function impacts positively 3D modeling quality by providing a small but consistent improvement in metrics such as GDT-HA and lDDT and a large increase in stereochemical quality. Python modules to harness this second strategy are freely available at https://github.com/pymodproject/altmod. In summary, we show that there is a large room for improving MODELLER in terms of 3D modeling quality and we propose strategies that could be pursued in order to further increase its performance.<br />Author summary Proteins are fundamental biological molecules that carry out countless activities in living beings. Since the function of proteins is dictated by their three-dimensional atomic structures, acquiring structural details of proteins provides deep insights into their function. Currently, the most frequently used computational approach for protein structure prediction is template-based modeling. In this approach, a target protein is modeled using the experimentally-derived structural information of a template protein assumed to have a similar structure to the target. MODELLER is the most frequently used program for template-based 3D model building. Despite its success, its predictions are not always accurate enough to be useful in Biomedical Research. Here, we show that it is possible to greatly increase the performance of MODELLER by modifying two aspects of its algorithm. First, we demonstrate that providing the program with accurate estimations of local target-template structural divergence greatly increases the quality of its predictions. Additionally, we show that modifying MODELLER’s scoring function with statistical potential energetic terms also helps to improve modeling quality. This work will be useful in future research, since it reports practical strategies to improve the performance of this core tool in Structural Bioinformatics.
- Subjects :
- 0301 basic medicine
Models, Molecular
Protein Structure Comparison
Computer science
Economics
Social Sciences
Protein Structure Prediction
computer.software_genre
Biochemistry
Machine Learning
Database and Informatics Methods
0302 clinical medicine
computational biology
Macromolecular Structure Analysis
Biology (General)
homology modelling
computer.programming_language
media_common
averaging
Ecology
Settore BIO/11
Applied Mathematics
Simulation and Modeling
MODELLER
Protein structure prediction
3D modeling
structure prediction
molecular dynamics simulation
Computational Theory and Mathematics
Modeling and Simulation
Physical Sciences
sequence alignment
Protein homology
Target protein
Statistical potential
Sequence Analysis
Algorithms
Research Article
Employment
Optimization
Protein Structure
Computer and Information Sciences
QH301-705.5
Bioinformatics
media_common.quotation_subject
Machine learning
Research and Analysis Methods
03 medical and health sciences
Cellular and Molecular Neuroscience
Machine Learning Algorithms
models
Artificial Intelligence
Genetics
Quality (business)
Homology modeling
proteins
models, molecular
software
structural homology, protein
molecular
Divergence (statistics)
Molecular Biology
Ecology, Evolution, Behavior and Systematics
structural homology
business.industry
Biology and Life Sciences
Python (programming language)
030104 developmental biology
Structural Homology, Protein
Labor Economics
Artificial intelligence
business
protein
computer
030217 neurology & neurosurgery
Mathematics
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- PLoS computational biology 15 (2019). doi:10.1371/journal.pcbi.1007219, info:cnr-pdr/source/autori:Janson, Giacomo; Grottesi, Alessandro; Pietrosanto, Marco; Ausiello, Gabriele; Guarguaglini, Giulia; Paiardini, Alessandro/titolo:Revisiting the "satisfaction of spatial restraints" approach of MODELLER for protein homology modeling/doi:10.1371%2Fjournal.pcbi.1007219/rivista:PLoS computational biology/anno:2019/pagina_da:/pagina_a:/intervallo_pagine:/volume:15, PLoS Computational Biology, PLoS Computational Biology, Vol 15, Iss 12, p e1007219 (2019)
- Accession number :
- edsair.doi.dedup.....2ac94007ce3c61232a24cc59c3095e60
- Full Text :
- https://doi.org/10.1371/journal.pcbi.1007219