Back to Search
Start Over
Multi-task clustering through instances transfer
- Source :
- Neurocomputing. 251:145-155
- Publication Year :
- 2017
- Publisher :
- Elsevier BV, 2017.
-
Abstract
- We propose a multi-task clustering method by transferring knowledge of instances.The sample distance in different tasks is reweighted by learning a shared subspace.Related samples from other tasks are reused as auxiliary data to aid clustering.Our method maintains the label marginal distribution of each individual task.Better performance is observed compared with other multi-task clustering methods. Clustering is an essential issue in machine learning and data mining. As there are many related tasks in the real world, multi-task clustering, which improves the clustering performance of each task by transferring knowledge across the related tasks, receives increasing attention recently. Generally knowledge transfer can be accomplished in different ways. Nevertheless, besides transferring knowledge of feature representations, other knowledge transfer ways have seldom been adopted for multi-task clustering. In this paper, we propose a general multi-task clustering algorithm by transferring knowledge of instances. Our algorithm reweights the distance between samples in different tasks by learning a shared subspace, then selects the nearest neighbors for each sample from the other tasks in the learned shared subspace as the auxiliary data to aid the clustering process of each individual task. Experiments on real data sets in text mining and image mining demonstrate that our proposed algorithm outperforms the traditional single-task clustering methods and existing cross-domain multi-task clustering methods.
- Subjects :
- DBSCAN
Clustering high-dimensional data
Fuzzy clustering
Computer science
Cognitive Neuroscience
Correlation clustering
Conceptual clustering
Multi-task learning
02 engineering and technology
010501 environmental sciences
computer.software_genre
Machine learning
01 natural sciences
Biclustering
Text mining
Artificial Intelligence
CURE data clustering algorithm
Consensus clustering
0202 electrical engineering, electronic engineering, information engineering
Cluster analysis
0105 earth and related environmental sciences
Brown clustering
business.industry
Constrained clustering
Computer Science Applications
Hierarchical clustering
Data set
ComputingMethodologies_PATTERNRECOGNITION
Data stream clustering
Canopy clustering algorithm
Affinity propagation
FLAME clustering
020201 artificial intelligence & image processing
Data mining
Artificial intelligence
business
computer
Subspace topology
Subjects
Details
- ISSN :
- 09252312
- Volume :
- 251
- Database :
- OpenAIRE
- Journal :
- Neurocomputing
- Accession number :
- edsair.doi...........07519b0daf212ab06b598be8ee3bbcb1