Back to Search Start Over

Unsupervised text feature selection technique based on hybrid particle swarm optimization algorithm with genetic operators for the text clustering.

Authors :
Abualigah, Laith
Khader, Ahamad
Source :
Journal of Supercomputing; Nov2017, Vol. 73 Issue 11, p4773-4795, 23p
Publication Year :
2017

Abstract

The text clustering technique is an appropriate method used to partition a huge amount of text documents into groups. The documents size affects the text clustering by decreasing its performance. Subsequently, text documents contain sparse and uninformative features, which reduce the performance of the underlying text clustering algorithm and increase the computational time. Feature selection is a fundamental unsupervised learning technique used to select a new subset of informative text features to improve the performance of the text clustering and reduce the computational time. This paper proposes a hybrid of particle swarm optimization algorithm with genetic operators for the feature selection problem. The k-means clustering is used to evaluate the effectiveness of the obtained features subsets. The experiments were conducted using eight common text datasets with variant characteristics. The results show that the proposed algorithm hybrid algorithm (H-FSPSOTC) improved the performance of the clustering algorithm by generating a new subset of more informative features. The proposed algorithm is compared with the other comparative algorithms published in the literature. Finally, the feature selection technique encourages the clustering algorithm to obtain accurate clusters. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09208542
Volume :
73
Issue :
11
Database :
Complementary Index
Journal :
Journal of Supercomputing
Publication Type :
Academic Journal
Accession number :
125744828
Full Text :
https://doi.org/10.1007/s11227-017-2046-2