Back to Search Start Over

Guided Semi-Supervised Non-Negative Matrix Factorization.

Authors :
Li, Pengyu
Tseng, Christine
Zheng, Yaxuan
Chew, Joyce A.
Huang, Longxiu
Jarman, Benjamin
Needell, Deanna
Source :
Algorithms. May2022, Vol. 15 Issue 5, p136. 18p.
Publication Year :
2022

Abstract

Classification and topic modeling are popular techniques in machine learning that extract information from large-scale datasets. By incorporating a priori information such as labels or important features, methods have been developed to perform classification and topic modeling tasks; however, most methods that can perform both do not allow for guidance of the topics or features. In this paper, we propose a novel method, namely Guided Semi-Supervised Non-negative Matrix Factorization (GSSNMF), that performs both classification and topic modeling by incorporating supervision from both pre-assigned document class labels and user-designed seed words. We test the performance of this method on legal documents provided by the California Innocence Project and the 20 Newsgroups dataset. Our results show that the proposed method improves both classification accuracy and topic coherence in comparison to past methods such as Semi-Supervised Non-negative Matrix Factorization (SSNMF), Guided Non-negative Matrix Factorization (Guided NMF), and Topic Supervised NMF. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
19994893
Volume :
15
Issue :
5
Database :
Academic Search Index
Journal :
Algorithms
Publication Type :
Academic Journal
Accession number :
157129914
Full Text :
https://doi.org/10.3390/a15050136