Back to Search Start Over

A transfer learning approach via procrustes analysis and mean shift for cancer drug sensitivity prediction.

Authors :
Turki, Turki
Wei, Zhi
Wang, Jason T. L.
Source :
Journal of Bioinformatics & Computational Biology; Jun2018, Vol. 16 Issue 3, pN.PAG-N.PAG, 31p
Publication Year :
2018

Abstract

Transfer learning (TL) algorithms aim to improve the prediction performance in a target task (e.g. the prediction of cisplatin sensitivity in triple-negative breast cancer patients) via transferring knowledge from auxiliary data of a related task (e.g. the prediction of docetaxel sensitivity in breast cancer patients), where the distribution and even the feature space of the data pertaining to the tasks can be different. In real-world applications, we sometimes have a limited training set in a target task while we have auxiliary data from a related task. To obtain a better prediction performance in the target task, supervised learning requires a sufficiently large training set in the target task to perform well in predicting future test examples of the target task. In this paper, we propose a TL approach for cancer drug sensitivity prediction, where our approach combines three techniques. First, we shift the representation of a subset of examples from auxiliary data of a related task to a representation closer to a target training set of a target task. Second, we align the shifted representation of the selected examples of the auxiliary data to the target training set to obtain examples with representation aligned to the target training set. Third, we train machine learning algorithms using both the target training set and the aligned examples. We evaluate the performance of our approach against baseline approaches using the Area Under the receiver operating characteristic (ROC) Curve (AUC) on real clinical trial datasets pertaining to multiple myeloma, nonsmall cell lung cancer, triple-negative breast cancer, and breast cancer. Experimental results show that our approach is better than the baseline approaches in terms of performance and statistical significance. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02197200
Volume :
16
Issue :
3
Database :
Complementary Index
Journal :
Journal of Bioinformatics & Computational Biology
Publication Type :
Academic Journal
Accession number :
130376176
Full Text :
https://doi.org/10.1142/S0219720018400140