Back to Search Start Over

A topic modeling based approach to novel document automatic summarization.

Authors :
Wu, Zongda
Lei, Li
Li, Guiling
Huang, Hui
Zheng, Chengren
Chen, Enhong
Xu, Guandong
Source :
Expert Systems with Applications. Oct2017, Vol. 84, p12-23. 12p.
Publication Year :
2017

Abstract

Most of existing text automatic summarization algorithms are targeted for multi-documents of relatively short length, thus difficult to be applied immediately to novel documents of structure freedom and long length. In this paper, aiming at novel documents, we propose a topic modeling based approach to extractive automatic summarization, so as to achieve a good balance among compression ratio, summarization quality and machine readability. First, based on topic modeling, we extract the candidate sentences associated with topic words from a preprocessed novel document. Second, with the goals of compression ratio and topic diversity, we design an importance evaluation function to select the most important sentences from the candidate sentences and thus generate an initial novel summary. Finally, we smooth the initial summary to overcome the semantic confusion caused by ambiguous or synonymous words, so as to improve the summary readability. We evaluate experimentally our proposed approach on a real novel dataset. The experiment results show that compared to those from other candidate algorithms, each automatic summary generated by our approach has not only a higher compression ratio, but also better summarization quality. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09574174
Volume :
84
Database :
Academic Search Index
Journal :
Expert Systems with Applications
Publication Type :
Academic Journal
Accession number :
123310151
Full Text :
https://doi.org/10.1016/j.eswa.2017.04.054