1. Redefining the Protein–Protein Interface: Coarse Graining and Combinatorics for an Improved Understanding of Amino Acid Contributions to the Protein–Protein Binding Affinity
- Author
-
Shaoyi Jiang, Jim Pfaendtner, and Josh K. Smith
- Subjects
Models, Molecular ,0301 basic medicine ,chemistry.chemical_classification ,Interface (Java) ,Chemistry ,Protein protein ,Proteins ,Surfaces and Interfaces ,Computational biology ,Condensed Matter Physics ,Amino acid ,03 medical and health sciences ,030104 developmental biology ,Lasso regression ,Biochemistry ,Electrochemistry ,Feature (machine learning) ,General Materials Science ,Amino Acid Sequence ,Granularity ,Amino Acids ,Spectroscopy ,Protein Binding ,Interpretability - Abstract
The ability to intervene in biological pathways has for decades been limited by the lack of a quantitative description of protein–protein interactions (PPIs). Herein we generate and compare millions of simple PPI models for insight into the mechanisms of specific recognition and binding. We use a coarse-grained approach whereby amino acids are counted in the interface, and these counts are used as binding affinity predictors. We perform lasso regression, a modern regression technique aimed at interpretability, with every possible amino acid combination (over 106 unique feature sets) to select only those amino acid predictors that provide more information than noise. This approach circumvents arbitrary binning and assumptions about the binding environment that obscure other binding affinity models. Aggregated analysis of these models trained at various interfacial cutoff distances informs the roles of specific amino acids in different binding contexts. We find that a simple amino acid count model outperfor...
- Published
- 2017
- Full Text
- View/download PDF