1. Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19
- Author
-
Mathialakan Thavappiragasam, Gilchan Park, Kendall G. Byler, Leighton Coates, Laura Zanetti-Polzi, Jeffrey M. Larkin, Junqi Yin, John A. Gunnels, Omar Demerdash, Loukas Petridis, Ada Sedova, Carlos Soto, Aaron Scheinberg, Mai Zahran, Scott LeGrand, Jens Glaser, Jerome Baudry, Stephan Irle, Samuel Yen-Chi Chen, Andrey Kovalevsky, Isabella Daidone, Julie C. Mitchell, Arvind Ramanathan, Connor J. Cooper, Duncan Poole, V. Q. Vuong, Diogo Santos-Martins, David M. Rogers, Shinjae Yoo, Y. Shen, Oscar Hernandez, A. Tsaris, Swen Boehm, Debsindhu Bhowmik, Travis J Lawrence, Daniel W. Kneller, Shih-Hsien Liu, Jeremy C. Smith, Line Pouchard, Matthew B. Baker, Stefano Forli, Sally R. Ellingson, Anna Pavlova, Rupesh Agarwal, Micholas Dean Smith, Atanu Acharya, James C. Gumbart, Andreas F. Tillack, John D. Eblen, Josh V. Vermaas, and Jerry M. Parks
- Subjects
Enhanced sampling ,Computer science ,Protein Conformation ,General Chemical Engineering ,Drug Evaluation, Preclinical ,Viral Nonstructural Proteins ,Replica exchange ,01 natural sciences ,Molecular Docking Simulation ,Molecular dynamics ,010304 chemical physics ,Drug discovery ,AutoDock ,Supercomputer ,Supercomputers ,Preclinical ,Spike Glycoprotein ,Computer Science Applications ,Spike Glycoprotein, Coronavirus ,Coronavirus disease 2019 (COVID-19) ,Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ,Chemical ,Library and Information Sciences ,Antiviral Agents ,Article ,Autodock vina ,Computational science ,Databases ,Structure-Activity Relationship ,Artificial Intelligence ,0103 physical sciences ,Humans ,Computer Simulation ,Binding Sites ,SARS-CoV-2 ,COVID-19 ,Proteins ,General Chemistry ,0104 chemical sciences ,COVID-19 Drug Treatment ,Coronavirus ,010404 medicinal & biomolecular chemistry ,Massively parallel supercomputing ,Docking (molecular) ,Drug Design ,Drug Evaluation ,Databases, Chemical - Abstract
We present a supercomputer-driven pipeline for in silico drug discovery using enhanced sampling molecular dynamics (MD) and ensemble docking. Ensemble docking makes use of MD results by docking compound databases into representative protein binding-site conformations, thus taking into account the dynamic properties of the binding sites. We also describe preliminary results obtained for 24 systems involving eight proteins of the proteome of SARS-CoV-2. The MD involves temperature replica exchange enhanced sampling, making use of massively parallel supercomputing to quickly sample the configurational space of protein drug targets. Using the Summit supercomputer at the Oak Ridge National Laboratory, more than 1 ms of enhanced sampling MD can be generated per day. We have ensemble docked repurposing databases to 10 configurations of each of the 24 SARS-CoV-2 systems using AutoDock Vina. Comparison to experiment demonstrates remarkably high hit rates for the top scoring tranches of compounds identified by our ensemble approach. We also demonstrate that, using Autodock-GPU on Summit, it is possible to perform exhaustive docking of one billion compounds in under 24 h. Finally, we discuss preliminary results and planned improvements to the pipeline, including the use of quantum mechanical (QM), machine learning, and artificial intelligence (AI) methods to cluster MD trajectories and rescore docking poses.
- Published
- 2020
- Full Text
- View/download PDF