1. Prospective Assessment of Virtual Screening Heuristics Derived Using a Novel Fusion Score
- Author
-
Michelle F. Homsher, Louis Locco, Shannon L. Stahler, Michael Weber, Jennifer E. Nothstein, Eleftheria N. Finger, Evelyn Boots, Mee Ra Heo, Amita Patel, Gregory O'Donnell, Alex Wolicki, Michael F.A. Finley, J. Christopher Culberson, Paul Zuck, Juncai Meng, Kenneth Roberts, David J. Bell, Peter S. Kutchukian, Gregory C. Adam, Carissa Quinn, Patrick Cocchiarella, Meir Glick, S. Alex May, Victor N. Uebele, Edward Hudak, Brian Squadroni, Daniel Riley, Andrew Rusinko, Kelli Solly, Michelle Hartnett, Dante A. Pertusi, Adam Amoss, Anthony Kreamer, Tara White, and Anne Mai Wassermann
- Subjects
0301 basic medicine ,Prioritization ,Engineering ,Drug Evaluation, Preclinical ,computer.software_genre ,01 natural sciences ,Biochemistry ,Analytical Chemistry ,Machine Learning ,User-Computer Interface ,03 medical and health sciences ,Heuristics ,Iterative and incremental development ,Virtual screening ,Drug discovery ,business.industry ,0104 chemical sciences ,Weighting ,010404 medicinal & biomolecular chemistry ,030104 developmental biology ,Cheminformatics ,Benchmark (computing) ,Molecular Medicine ,Data mining ,business ,computer ,Biotechnology - Abstract
High-throughput screening (HTS) is a widespread method in early drug discovery for identifying promising chemical matter that modulates a target or phenotype of interest. Because HTS campaigns involve screening millions of compounds, it is often desirable to initiate screening with a subset of the full collection. Subsequently, virtual screening methods prioritize likely active compounds in the remaining collection in an iterative process. With this approach, orthogonal virtual screening methods are often applied, necessitating the prioritization of hits from different approaches. Here, we introduce a novel method of fusing these prioritizations and benchmark it prospectively on 17 screening campaigns using virtual screening methods in three descriptor spaces. We found that the fusion approach retrieves 15% to 65% more active chemical series than any single machine-learning method and that appropriately weighting contributions of similarity and machine-learning scoring techniques can increase enrichment by 1% to 19%. We also use fusion scoring to evaluate the tradeoff between screening more chemical matter initially in lieu of replicate samples to prevent false-positives and find that the former option leads to the retrieval of more active chemical series. These results represent guidelines that can increase the rate of identification of promising active compounds in future iterative screens.
- Published
- 2017
- Full Text
- View/download PDF