Back to Search Start Over

Leveraging 2D molecular graph pretraining for improved 3D conformer generation with graph neural networks.

Authors :
Alhamoud, Kumail
Ghunaim, Yasir
Alshehri, Abdulelah S.
Li, Guohao
Ghanem, Bernard
You, Fengqi
Source :
Computers & Chemical Engineering. Apr2024, Vol. 183, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

• Pretraining on abundant 2D molecular graphs to enhance 3D tasks is explored. • The limitations of 3D molecular conformer generation are addressed. • Enhancements are proposed by expanding and pretraining molecular embeddings. • Chemical information is used to anchor molecular embeddings in chemistry principles. • Our approach yields advancements across evaluation metrics, setting new benchmarks. Predicting stable 3D molecular conformations from 2D molecular graphs is a challenging and resource-intensive task, yet it is critical for various applications, particularly drug design. Density functional theory (DFT) calculations set the standard for molecular conformation generation, yet they are computationally intensive. Deep learning offers more computationally efficient approaches, but struggles to match DFT accuracy, particularly on complex drug-like structures. Additionally, the steep computational demands of assembling 3D molecular datasets constrain the broader adoption of deep learning. This work aims to utilize the abundant 2D molecular graph datasets for pretraining a machine learning model, a step that involves initially training the model on a different task with a wealth of data before fine-tuning it for the target task of 3D conformation generation. We build on GeoMol, an end-to-end graph neural network (GNN) method for predicting atomic 3D structures and torsion angles. We examine the limitations of the GeoMol method and introduce new baselines to enhance molecular graph embeddings. Our computational results show that 2D molecular graph pretraining enhances the quality of generated 3D conformers, yielding a 7.7 % average improvement over state-of-the-art sequential methods. These advancements not only facilitate superior 3D conformation generation but also emphasize the potential of leveraging pretrained graph embeddings to boost performance in 3D chemical tasks with GNNs. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00981354
Volume :
183
Database :
Academic Search Index
Journal :
Computers & Chemical Engineering
Publication Type :
Academic Journal
Accession number :
175569148
Full Text :
https://doi.org/10.1016/j.compchemeng.2024.108622