1. Practical Applications of Deep Learning To Impute Heterogeneous Drug Discovery Data.
- Author
-
Irwin BWJ, Levell JR, Whitehead TM, Segall MD, and Conduit GJ
- Subjects
- Chemistry, Pharmaceutical, Quantitative Structure-Activity Relationship, Deep Learning, Drug Discovery
- Abstract
Contemporary deep learning approaches still struggle to bring a useful improvement in the field of drug discovery because of the challenges of sparse, noisy, and heterogeneous data that are typically encountered in this context. We use a state-of-the-art deep learning method, Alchemite, to impute data from drug discovery projects, including multitarget biochemical activities, phenotypic activities in cell-based assays, and a variety of absorption, distribution, metabolism, and excretion (ADME) endpoints. The resulting model gives excellent predictions for activity and ADME endpoints, offering an average increase in R
2 of 0.22 versus quantitative structure-activity relationship methods. The model accuracy is robust to combining data across uncorrelated endpoints and projects with different chemical spaces, enabling a single model to be trained for all compounds and endpoints. We demonstrate improvements in accuracy on the latest chemistry and data when updating models with new data as an ongoing medicinal chemistry project progresses.- Published
- 2020
- Full Text
- View/download PDF