1. Chemical structure-aware molecular image representation learning.
- Author
-
Xiang, Hongxin, Jin, Shuting, Liu, Xiangrong, Zeng, Xiangxiang, and Zeng, Li
- Subjects
- *
IMAGE representation , *DRUG discovery , *KNOWLEDGE graphs , *CHEMICAL structure , *MOLECULAR graphs , *DOUBLE bonds , *KNOWLEDGE transfer - Abstract
Current methods of molecular image-based drug discovery face two major challenges: (1) work effectively in absence of labels, and (2) capture chemical structure from implicitly encoded images. Given that chemical structures are explicitly encoded by molecular graphs (such as nitrogen, benzene rings and double bonds), we leverage self-supervised contrastive learning to transfer chemical knowledge from graphs to images. Specifically, we propose a novel Contrastive Graph-Image Pre-training (CGIP) framework for molecular representation learning, which learns explicit information in graphs and implicit information in images from large-scale unlabeled molecules via carefully designed intra- and inter-modal contrastive learning. We evaluate the performance of CGIP on multiple experimental settings (molecular property prediction, cross-modal retrieval and distribution similarity), and the results show that CGIP can achieve state-of-the-art performance on all 12 benchmark datasets and demonstrate that CGIP transfers chemical knowledge in graphs to molecular images, enabling image encoder to perceive chemical structures in images. We hope this simple and effective framework will inspire people to think about the value of image for molecular representation learning. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF