Back to Search Start Over

Multi-label text classification with an ensemble feature space.

Authors :
Tandon, Kushagri
Chatterjee, Niladri
Pinto, David
Beltrán, Beatriz
Singh, Vivek
Source :
Journal of Intelligent & Fuzzy Systems; 2022, Vol. 42 Issue 5, p4425-4436, 12p
Publication Year :
2022

Abstract

Multi-label text classification aims at assigning more than one class to a given text document, which makes the task more ambiguous and challenging at the same time. The ambiguities come from the fact that often several labels in the prescribed label set are semantically close to each other, making clear demarcation between them difficult. As a consequence, any Machine Learning based approach for developing multi-label classification scheme needs to define its feature space by choosing features beyond linguistic or semi-linguistic features, so that the semantic closeness between the labels is also taken into account. The present work describes a scheme of feature extraction where the training document set and the prescribed label set are intertwined in a novel way to capture the ambiguity in a meaningful way. In particular, experiments were conducted using Topic Modeling and Fuzzy C-Means clustering which aim at measuring the underlying uncertainty using probability and membership based measures, respectively. Several Nonparametric hypothesis tests establish the effectiveness of the features obtained through Fuzzy C-Means clustering in multi-label classification. A new algorithm has been proposed for training the system for multi-label classification using the above set of features. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10641246
Volume :
42
Issue :
5
Database :
Complementary Index
Journal :
Journal of Intelligent & Fuzzy Systems
Publication Type :
Academic Journal
Accession number :
156139425
Full Text :
https://doi.org/10.3233/JIFS-219232