Back to Search Start Over

Combine Topic Modeling with Semantic Embedding: Embedding Enhanced Topic Model.

Authors :
Zhang, Peng
Wang, Suge
Li, Deyu
Li, Xiaoli
Xu, Zhikang
Source :
IEEE Transactions on Knowledge & Data Engineering. Dec2020, Vol. 32 Issue 12, p2322-2335. 14p.
Publication Year :
2020

Abstract

Topic model and word embedding reflect two perspectives of text semantics. Topic model maps documents into topic distribution space by utilizing word collocation patterns within and across documents, while word embedding represents words within a continuous embedding space by exploiting the local word collocation patterns in context windows. Clearly, these two types of patterns are complementary. In this paper, we propose a novel integration framework to combine the two representation methods, where topic information can be transmitted into corresponding semantic embedding structure. Based on this framework, we construct a Embedding Enhanced Topic Model (EETM), which can improve topic modeling and generate topic embeddings by leveraging the word embedding. Extensive experimental results show that EETM can learn high-quality document representations for common text analysis tasks across multiple data sets, indicating it is very effective for merging topic models with word embeddings. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10414347
Volume :
32
Issue :
12
Database :
Academic Search Index
Journal :
IEEE Transactions on Knowledge & Data Engineering
Publication Type :
Academic Journal
Accession number :
147023048
Full Text :
https://doi.org/10.1109/TKDE.2019.2922179