Back to Search Start Over

A Text Clustering Approach of Chinese News Based on Neural Network Language Model.

Authors :
Fan, Zhaoxin
Chen, Shuoying
Zha, Li
Yang, Jiadong
Source :
International Journal of Parallel Programming; Feb2016, Vol. 44 Issue 1, p198-206, 9p
Publication Year :
2016

Abstract

Text clustering plays an important role in data mining and machine learning. After years of development, clustering technology has produced a series of theories and methods. However, in the text clustering of Chinese news, the mainstream LDA method suffers a high time complex. In order to improve the speed, this paper puts forward a new method in which neural network language model is first applied to text clustering. Text clustering is first converted to its dual problem called word clustering. With neural network language model, we can get word vector which can be used in the fuzzy k-means of the Chinese news keyword set. Based on the keyword clustering result, we can get text clustering result of Chinese news by a single transition. Experiments have show this method's running speed is five times faster than LDA. This method has been successfully used in the Sohu news recommendation system currently. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
08857458
Volume :
44
Issue :
1
Database :
Complementary Index
Journal :
International Journal of Parallel Programming
Publication Type :
Academic Journal
Accession number :
112194000
Full Text :
https://doi.org/10.1007/s10766-014-0329-2