Back to Search
Start Over
Non-Negative Matrix Factorization for Semisupervised Heterogeneous Data Coclustering.
- Source :
-
IEEE Transactions on Knowledge & Data Engineering . Oct2010, Vol. 22 Issue 10, p1459-1474. 0p. 2 Diagrams, 8 Charts, 5 Graphs. - Publication Year :
- 2010
-
Abstract
- Coclustering heterogeneous data has attracted extensive attention recently due to its high impact on various important applications, such us text mining, image retrieval, and bioinformatics. However, data coclustering without any prior knowledge or background information is still a challenging problem. In this paper, we propose a Semisupervised Non-negative Matrix Factorization (SS-NMF) framework for data coclustering. Specifically, our method computes new relational matrices by incorporating user provided constraints through simultaneous distance metric learning and modality selection. Using an iterative algorithm, we then perform trifactorizations of the new matrices to infer the clusters of different data types and their correspondence. Theoretically, we prove the convergence and correctness of SS-NMF coclustering and show the relationship between SS-NMF with other well-known coclustering models. Through extensive experiments conducted on publicly available text, gene expression, and image data sets, we demonstrate the superior performance of SS-NMF for heterogeneous data coclustering. [ABSTRACT FROM AUTHOR]
- Subjects :
- *FACTORIZATION
*TEXT mining
*IMAGE retrieval
*BIOINFORMATICS
*ALGORITHMS
Subjects
Details
- Language :
- English
- ISSN :
- 10414347
- Volume :
- 22
- Issue :
- 10
- Database :
- Academic Search Index
- Journal :
- IEEE Transactions on Knowledge & Data Engineering
- Publication Type :
- Academic Journal
- Accession number :
- 53047702
- Full Text :
- https://doi.org/10.1109/TKDE.2009.169