Back to Search
Start Over
Hypergraph-based persistent cohomology (HPC) for molecular representations in drug design
- Source :
- Briefings in bioinformatics. 22(5)
- Publication Year :
- 2020
-
Abstract
- Artificial intelligence (AI) based drug design has demonstrated great potential to fundamentally change the pharmaceutical industries. Currently, a key issue in AI-based drug design is efficient transferable molecular descriptors or fingerprints. Here, we present hypergraph-based molecular topological representation, hypergraph-based (weighted) persistent cohomology (HPC/HWPC) and HPC/HWPC-based molecular fingerprints for machine learning models in drug design. Molecular structures and their atomic interactions are highly complicated and pose great challenges for efficient mathematical representations. We develop the first hypergraph-based topological framework to characterize detailed molecular structures and interactions at atomic level. Inspired by the elegant path complex model, hypergraph-based embedded homology and persistent homology have been proposed recently. Based on them, we construct HPC/HWPC, and use them to generate molecular descriptors for learning models in protein-ligand binding affinity prediction, one of the key step in drug design. Our models are tested on three most commonly-used databases, including PDBbind-v2007, PDBbind-v2013 and PDBbind-v2016, and outperform all existing machine learning models with traditional molecular descriptors. Our HPC/HWPC models have demonstrated great potential in AI-based drug design. Ministry of Education (MOE) Nanyang Technological University This work was supported in part by Nanyang Technological University Startup Grant M4081842 and Singapore Ministry of Education Academic Research fund Tier 1 RG31/18 and RG109/19, Tier 2 MOE2018-T2-1-033. The second author was supported by Natural Science Foundation of China (NSFC grant no. 11871284). The third author was supported by Natural Science Foundation of China (NSFC grant no. 11971144) and High-level Scientific Research Foundation of Hebei Province.
- Subjects :
- Mathematics [Science]
Molecular Descriptor
0303 health sciences
Hypergraph
Theoretical computer science
Persistent homology
Computer science
010102 general mathematics
Construct (python library)
Homology (mathematics)
01 natural sciences
Cohomology
Machine Learning
03 medical and health sciences
Models, Chemical
Molecular descriptor
Drug Design
Path (graph theory)
0101 mathematics
Representation (mathematics)
Databases, Protein
Molecular Biology
030304 developmental biology
Information Systems
Subjects
Details
- ISSN :
- 14774054
- Volume :
- 22
- Issue :
- 5
- Database :
- OpenAIRE
- Journal :
- Briefings in bioinformatics
- Accession number :
- edsair.doi.dedup.....000deb19aa3d13240782025729a7bf5a