1. Experimental determination and data-driven prediction of homotypic transmembrane domain interfaces
- Author
-
Dieter Langosch, Yao Xiao, Dmitrij Frishman, Nicola Berner, Bo Zeng, and Mark G. Teese
- Subjects
lcsh:Biotechnology ,Biophysics ,Computational biology ,Biochemistry ,Interactome ,Data-driven ,Protein–protein interaction ,03 medical and health sciences ,Protein-protein interaction ,0302 clinical medicine ,Data sequences ,TMD interactions ,Structural Biology ,lcsh:TP248.13-248.65 ,Machine learning ,Genetics ,030304 developmental biology ,Physics ,0303 health sciences ,GxxxG ,Protein primary structure ,Transmembrane ,Transmembrane protein ,ddc ,Computer Science Applications ,Co-evolution ,Transmembrane domain ,Membrane protein ,030220 oncology & carcinogenesis ,Research Article ,Biotechnology - Abstract
Highlights • Homotypic TMD interfaces identified by different techniques share strong similarities. • The GxxxG motif is the feature most strongly associated with interfaces. • Other features include conservation, polarity, coevolution, and depth in the membrane • The role of each of each feature strongly depends on the individual protein. • Machine-learning helps predict interfaces from evolutionary sequence data, Interactions between their transmembrane domains (TMDs) frequently support the assembly of single-pass membrane proteins to non-covalent complexes. Yet, the TMD-TMD interactome remains largely uncharted. With a view to predicting homotypic TMD-TMD interfaces from primary structure, we performed a systematic analysis of their physical and evolutionary properties. To this end, we generated a dataset of 50 self-interacting TMDs. This dataset contains interfaces of nine TMDs from bitopic human proteins (Ire1, Armcx6, Tie1, ATP1B1, PTPRO, PTPRU, PTPRG, DDR1, and Siglec7) that were experimentally identified here and combined with literature data. We show that interfacial residues of these homotypic TMD-TMD interfaces tend to be more conserved, coevolved and polar than non-interfacial residues. Further, we suggest for the first time that interface positions are deficient in β-branched residues, and likely to be located deep in the hydrophobic core of the membrane. Overrepresentation of the GxxxG motif at interfaces is strong, but that of (small)xxx(small) motifs is weak. The multiplicity of these features and the individual character of TMD-TMD interfaces, as uncovered here, prompted us to train a machine learning algorithm. The resulting prediction method, THOIPA (www.thoipa.org), excels in the prediction of key interface residues from evolutionary sequence data.
- Published
- 2020
- Full Text
- View/download PDF