Back to Search
Start Over
Multi-task learning with a natural metric for quantitative structure activity relationship learning
- Source :
- Journal of Cheminformatics, Journal of Cheminformatics, 11(1):68. BioMed Central, Journal of Cheminformatics, Vol 11, Iss 1, Pp 1-13 (2019), Journal of Cheminformatics, 11, 68
- Publication Year :
- 2020
- Publisher :
- Springer Science and Business Media LLC, 2020.
-
Abstract
- © The Author(s) 2019. The goal of quantitative structure activity relationship (QSAR) learning is to learn a function that, given the structure of a small molecule (a potential drug), outputs the predicted activity of the compound. We employed multi-task learning (MTL) to exploit commonalities in drug targets and assays. We used datasets containing curated records about the activity of specific compounds on drug targets provided by ChEMBL. Totally, 1091 assays have been analysed. As a baseline, a single task learning approach that trains random forest to predict drug activity for each drug target individually was considered. We then carried out feature-based and instance-based MTL to predict drug activities. We introduced a natural metric of evolutionary distance between drug targets as a measure of tasks relatedness. Instance-based MTL significantly outperformed both, feature-based MTL and the base learner, on 741 drug targets out of 1091. Feature-based MTL won on 179 occasions and the base learner performed best on 171 drug targets. We conclude that MTL QSAR is improved by incorporating the evolutionary distance between targets. These results indicate that QSAR learning can be performed effectively, even if little data is available for specific drug targets, by leveraging what is known about similar drug targets. This research was funded by the Engineering and Physical Sciences Research Council (EPSRC) grant EP/K030469/1. NS would like to thank the EU PhenoM-eNal project (Horizon 2020, 654241)
- Subjects :
- Quantitative structure–activity relationship
Computer science
media_common.quotation_subject
education
multi-task learning
Multi-task learning
Sequence-based similarity
quantitative structure activity relationship
02 engineering and technology
Library and Information Sciences
Machine learning
computer.software_genre
sequence-based similarity
lcsh:Chemistry
03 medical and health sciences
0202 electrical engineering, electronic engineering, information engineering
Feature (machine learning)
Physical and Theoretical Chemistry
QA
Function (engineering)
030304 developmental biology
media_common
0303 health sciences
lcsh:T58.5-58.64
lcsh:Information technology
business.industry
chEMBL
Computer Graphics and Computer-Aided Design
Computer Science Applications
Random forest
Drug activity
lcsh:QD1-999
Quantitative structure activity relationship
Metric (mathematics)
020201 artificial intelligence & image processing
Artificial intelligence
business
computer
random forest
psychological phenomena and processes
Research Article
Subjects
Details
- ISSN :
- 17582946
- Database :
- OpenAIRE
- Journal :
- Journal of Cheminformatics, Journal of Cheminformatics, 11(1):68. BioMed Central, Journal of Cheminformatics, Vol 11, Iss 1, Pp 1-13 (2019), Journal of Cheminformatics, 11, 68
- Accession number :
- edsair.doi.dedup.....284b488905d030ea265df05dc9d34242
- Full Text :
- https://doi.org/10.17863/cam.54817