Back to Search Start Over

Sparse Tensor Co-clustering as a Tool for Document Categorization

Authors :
Rafika Boutalbi
Lazhar Labiod
Mohamed Nadif
Source :
SIGIR
Publication Year :
2019
Publisher :
ACM, 2019.

Abstract

To deal with document clustering, we usually rely on document-term matrices. However, from additional available information like keywords, co-authors, citations we might rather exploit a reorganization of the data in the form of a tensor. In this paper, we extend the use of the Sparse Poisson Latent Block Model to deal with sparse tensor data using jointly all information arising from documents. The proposed model is parsimonious and tailored for this kind of data. To estimate the parameters, we derive a suitable tensor co-clustering algorithm. Empirical results on several real-world text datasets highlight the advantages of our proposal which improves the clustering results of documents.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
Accession number :
edsair.doi...........0c609a5a00cb0716a7bece4877d821a2
Full Text :
https://doi.org/10.1145/3331184.3331360