Back to Search Start Over

Predicting combinative drug pairs via multiple classifier system with positive samples only.

Authors :
Shi, Jian-Yu
Li, Jia-Xin
Mao, Kui-Tao
Cao, Jiang-Bo
Lei, Peng
Lu, Hui-Meng
Yiu, Siu-Ming
Source :
Computer Methods & Programs in Biomedicine. Jan2019, Vol. 168, p1-10. 10p.
Publication Year :
2019

Abstract

Highlights • Five heterogeneous features are extracted to characterize drugs and drug pairs. • A two-layer MCS is designed to ensemble features for predicting drug combination. • MCS consists of one-class SVMs and is trained by only approved drug combinations. • Combining modes and targeting pathways of drugs in combination are investigated. Abstract Background and Objective Due to the synergistic effects of drugs, drug combination is one of the effective approaches for treating complex diseases. However, the identification of drug combinations by dose-response methods is still costly. It is promising to develop supervised learning-based approaches to predict potential drug combinations on a large scale. Nevertheless, these approaches have the inadequate utilization of heterogeneous features, which causes the loss of information useful to classification. Moreover, they have an intrinsic bias, because they assume unknown drug pairs as non-combinations, of which some could be real drug combinations in practice. Methods To address above issues, this work first designs a two-layer multiple classifier system (TLMCS) to effectively integrate heterogeneous features involving anatomical therapeutic chemical codes of drugs, drug-drug interactions, drug-target interactions, gene ontology of drug targets, and side effects. To avoid the bias caused by labelling unknown samples as negative, it then utilizes the one-class support vector machines, (which requires no negative instance and only labels approved drug combinations as positive instances), as the member classifiers in TLMCS. Last, both a 10-fold cross validation (10-CV) and a novel prediction are performed to validate the performance of TLMCS. Results The comparison with three state-of-the-art approaches under 10-CV exhibits the superiority of TLMCS, which achieves the area under the receiver operating characteristic curve = 0.824 and the area under the precision-recall curve = 0.372. Moreover, the experiment under the novel prediction demonstrates its ability, where 9 out of the top-20 predicted combinative drug pairs are validated by checking the published literature. Furthermore, for each of the newly-validated drug combinations, this work analyses the combining mode of the member drugs and investigates their relationship in terms of drug targeting pathways. Conclusions The proposed TLMCS provides an effective framework to integrate those heterogeneous features and is trained by only positive samples such that the bias of taking unknown drug pairs as negative samples can be avoided. Furthermore, its results in the novel prediction reveal five types of drug combinations and three types of drug relationships in terms of pathways. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01692607
Volume :
168
Database :
Academic Search Index
Journal :
Computer Methods & Programs in Biomedicine
Publication Type :
Academic Journal
Accession number :
133391163
Full Text :
https://doi.org/10.1016/j.cmpb.2018.11.002