Back to Search Start Over

Topic representation using semantic-based patterns

Authors :
Le, Thuc D.
Liu, Lin
Ong, Kok-Leong
Zhao, Yanchang
Jin, Warren H.
Wong, Sebastien
Williams, Graham
Kapugama Geeganage, Dakshi
Xu, Yue
Li, Yuefeng
Le, Thuc D.
Liu, Lin
Ong, Kok-Leong
Zhao, Yanchang
Jin, Warren H.
Wong, Sebastien
Williams, Graham
Kapugama Geeganage, Dakshi
Xu, Yue
Li, Yuefeng
Source :
Data Mining: 17th Australasian Conference, AusDM 2019, Proceedings (Communications in Computer and Information Science book series, Volume 1127)
Publication Year :
2019

Abstract

Topic modelling is the state of the art technique for understanding, organizing, and extracting information from text collections. Traditional topic modeling approaches apply probabilistic techniques to generate the list of topics from collections. Nevertheless, human understands, summarizes and discovers the topics based on the meaning of the content. Hence, the quality of the topic models can be improved by grasping the meaning from the content. In this paper, we propose an approach to identify sets of meaningful terms based on ontology, called Semantic-based Patterns, which represent the content of a collection of documents. A set of related semantic-based patterns can be used to represent a latent topic in the collection. The proposed Topic Representation using Semantic-based Patterns aims to generate semantically meaningful patterns based on ontology rather than term co-occurrence as what existing topic modelling methods do. The semantically meaningful patterns were evaluated by applying the information filtering to semantic-based topic representation. The semantic based patterns were used as features for information filtering and were evaluated by comparing against popular information filtering baseline systems. Topic quality was evaluated in terms of topic coherence and perplexity. The experimental results verified that the quality of the proposed patterns was better than features used in baseline systems for information filtering. Further, the quality of topic representation outperforms the generated topics of other topic modeling approaches.

Details

Database :
OAIster
Journal :
Data Mining: 17th Australasian Conference, AusDM 2019, Proceedings (Communications in Computer and Information Science book series, Volume 1127)
Publication Type :
Electronic Resource
Accession number :
edsoai.on1146610566
Document Type :
Electronic Resource