Back to Search
Start Over
Sparse Tensor Co-clustering as a Tool for Document Categorization
- Source :
- SIGIR
- Publication Year :
- 2019
- Publisher :
- ACM, 2019.
-
Abstract
- To deal with document clustering, we usually rely on document-term matrices. However, from additional available information like keywords, co-authors, citations we might rather exploit a reorganization of the data in the form of a tensor. In this paper, we extend the use of the Sparse Poisson Latent Block Model to deal with sparse tensor data using jointly all information arising from documents. The proposed model is parsimonious and tailored for this kind of data. To estimate the parameters, we derive a suitable tensor co-clustering algorithm. Empirical results on several real-world text datasets highlight the advantages of our proposal which improves the clustering results of documents.
Details
- Database :
- OpenAIRE
- Journal :
- Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
- Accession number :
- edsair.doi...........0c609a5a00cb0716a7bece4877d821a2
- Full Text :
- https://doi.org/10.1145/3331184.3331360