Back to Search
Start Over
Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins
- Source :
- Bioinformatics
- Publication Year :
- 2010
- Publisher :
- Oxford University Press, 2010.
-
Abstract
- Motivation: Protein–protein interactions (PPIs) are critical for virtually every biological function. Recently, researchers suggested to use supervised learning for the task of classifying pairs of proteins as interacting or not. However, its performance is largely restricted by the availability of truly interacting proteins (labeled). Meanwhile, there exists a considerable amount of protein pairs where an association appears between two partners, but not enough experimental evidence to support it as a direct interaction (partially labeled). Results: We propose a semi-supervised multi-task framework for predicting PPIs from not only labeled, but also partially labeled reference sets. The basic idea is to perform multi-task learning on a supervised classification task and a semi-supervised auxiliary task. The supervised classifier trains a multi-layer perceptron network for PPI predictions from labeled examples. The semi-supervised auxiliary task shares network layers of the supervised classifier and trains with partially labeled examples. Semi-supervision could be utilized in multiple ways. We tried three approaches in this article, (i) classification (to distinguish partial positives with negatives); (ii) ranking (to rate partial positive more likely than negatives); (iii) embedding (to make data clusters get similar labels). We applied this framework to improve the identification of interacting pairs between HIV-1 and human proteins. Our method improved upon the state-of-the-art method for this task indicating the benefits of semi-supervised multi-task learning using auxiliary information. Availability: http://www.cs.cmu.edu/∼qyj/HIVsemi Contact: qyj@cs.cmu.edu
- Subjects :
- Statistics and Probability
Computer science
Human Immunodeficiency Virus Proteins
Multi-task learning
02 engineering and technology
Semi-supervised learning
Machine learning
computer.software_genre
Biochemistry
Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
03 medical and health sciences
Genomic Medicine
Artificial Intelligence
Protein Interaction Mapping
0202 electrical engineering, electronic engineering, information engineering
Humans
Molecular Biology
Human proteins
030304 developmental biology
0303 health sciences
Models, Statistical
business.industry
Supervised learning
Computational Biology
Proteins
Pattern recognition
Perceptron
Original Papers
Computer Science Applications
Computational Mathematics
ComputingMethodologies_PATTERNRECOGNITION
Computational Theory and Mathematics
Ranking
Data Interpretation, Statistical
HIV-1
Embedding
020201 artificial intelligence & image processing
Artificial intelligence
business
computer
Classifier (UML)
Algorithms
Subjects
Details
- Language :
- English
- ISSN :
- 13674811 and 13674803
- Volume :
- 26
- Issue :
- 18
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....5864dd7e81d39152e12c8d44326ebae3