Back to Search Start Over

An Atomistic Statistically Effective Energy Function for Computational Protein Design

Authors :
Isabelle André
Christopher M. Topham
Sophie Barbe
Laboratoire d'Ingénierie des Systèmes Biologiques et des Procédés (LISBP)
Centre National de la Recherche Scientifique (CNRS)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse)
Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Institut National de la Recherche Agronomique (INRA)
We gratefully acknowledge access granted to the HPC resources of the Midi-Pyrenees Regional Computing Centre (CALMIP, Toulouse, France).
ANR-12-MONU-0015,ProtiCAD,Modèles multi-physiques et algorithmes robotiques pour la conception assistée par ordinateur de protéines(2012)
French National Research Agency (ANR Project PROTICAD) [ANR-12-MONU-0015-03] Funding Text : This work was supported by the French National Research Agency (ANR Project PROTICAD, ANR-12-MONU-0015-03). We gratefully acknowledge access granted to the HPC resources of the Midi-Pyrenees Regional Computing Centre (CALMIP, Toulouse, France).
Institut National de la Recherche Agronomique (INRA)-Institut National des Sciences Appliquées - Toulouse (INSA Toulouse)
Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Institut National des Sciences Appliquées (INSA)-Université de Toulouse (UT)-Centre National de la Recherche Scientifique (CNRS)
Source :
Journal of Chemical Theory and Computation, Journal of Chemical Theory and Computation, American Chemical Society, 2016, 12 (8), pp.4146-4168. ⟨10.1021/acs.jctc.6b00090⟩, Journal of Chemical Theory and Computation, 2016, 12 (8), pp.4146-4168. ⟨10.1021/acs.jctc.6b00090⟩
Publication Year :
2016
Publisher :
American Chemical Society (ACS), 2016.

Abstract

Shortcomings in the definition of effective free-energy surfaces of proteins are recognized to be a major contributory factor responsible for the low success rates of existing automated methods for computational protein design (CPD). The formulation of an atomistic statistically effective energy function (SEEF) suitable for a wide range of CPD applications and its derivation from structural data extracted from protein domains and protein-ligand complexes are described here. The proposed energy function comprises nonlocal atom-based and local residue based SEEFs, which are coupled using a novel atom connectivity number factor to scale short-range, pairwise, nonbonded atomic interaction energies and a surface-area-dependent cavity energy term. This energy function was used to derive additional SEEFs describing the unfolded-state ensemble of any given residue sequence based on computed average energies for partially or fully solvent-exposed fragments in regions of irregular structure in native proteins. Relative thermal stabilities of 97 T4 bacteriophage lysozyme mutants were predicted from calculated energy differences for folded and unfolded states with an average unsigned error (AUE) of 0.84 kcal mol(-1) when compared to experiment. To demonstrate the utility of the energy function for CPD, further validation was carried out in tests of its capacity to recover cognate protein sequences and to discriminate native and near-native protein folds, loop conformers, and small-molecule ligand binding poses from non-native benchmark decoys. Experimental ligand binding free energies for a diverse set of 80 protein complexes could be predicted with an AUE of 2.4 kcal mol(-1) using an additional energy term to account for the loss in ligand configurational entropy upon binding. The atomistic SEEF is expected to improve the accuracy of residue-based coarse-grained SEEFs currently used in CPD and to extend the range of applications of extant atom-based protein statistical potentials.

Details

ISSN :
15499626 and 15499618
Volume :
12
Database :
OpenAIRE
Journal :
Journal of Chemical Theory and Computation
Accession number :
edsair.doi.dedup.....0847a4f54ab842398b28bdd25356b4e7