Back to Search Start Over

3DDPDs: describing protein dynamics for proteochemometric bioactivity prediction. A case for (mutant) G protein-coupled receptors

Authors :
Marina Gorostiola González
Remco L. van den Broek
Thomas G. M. Braun
Magdalini Chatzopoulou
Willem Jespers
Adriaan P. IJzerman
Laura H. Heitman
Gerard J. P. van Westen
Source :
Journal of Cheminformatics, Vol 15, Iss 1, Pp 1-19 (2023)
Publication Year :
2023
Publisher :
BMC, 2023.

Abstract

Abstract Proteochemometric (PCM) modelling is a powerful computational drug discovery tool used in bioactivity prediction of potential drug candidates relying on both chemical and protein information. In PCM features are computed to describe small molecules and proteins, which directly impact the quality of the predictive models. State-of-the-art protein descriptors, however, are calculated from the protein sequence and neglect the dynamic nature of proteins. This dynamic nature can be computationally simulated with molecular dynamics (MD). Here, novel 3D dynamic protein descriptors (3DDPDs) were designed to be applied in bioactivity prediction tasks with PCM models. As a test case, publicly available G protein-coupled receptor (GPCR) MD data from GPCRmd was used. GPCRs are membrane-bound proteins, which are activated by hormones and neurotransmitters, and constitute an important target family for drug discovery. GPCRs exist in different conformational states that allow the transmission of diverse signals and that can be modified by ligand interactions, among other factors. To translate the MD-encoded protein dynamics two types of 3DDPDs were considered: one-hot encoded residue-specific (rs) and embedding-like protein-specific (ps) 3DDPDs. The descriptors were developed by calculating distributions of trajectory coordinates and partial charges, applying dimensionality reduction, and subsequently condensing them into vectors per residue or protein, respectively. 3DDPDs were benchmarked on several PCM tasks against state-of-the-art non-dynamic protein descriptors. Our rs- and ps3DDPDs outperformed non-dynamic descriptors in regression tasks using a temporal split and showed comparable performance with a random split and in all classification tasks. Combinations of non-dynamic descriptors with 3DDPDs did not result in increased performance. Finally, the power of 3DDPDs to capture dynamic fluctuations in mutant GPCRs was explored. The results presented here show the potential of including protein dynamic information on machine learning tasks, specifically bioactivity prediction, and open opportunities for applications in drug discovery, including oncology.

Details

Language :
English
ISSN :
17582946
Volume :
15
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Journal of Cheminformatics
Publication Type :
Academic Journal
Accession number :
edsdoj.1a3bbdb4d14241b5a95862590f309cd3
Document Type :
article
Full Text :
https://doi.org/10.1186/s13321-023-00745-5