Back to Search
Start Over
Unsupervised extractive multi-document text summarization using a genetic algorithm.
- Source :
- Journal of Intelligent & Fuzzy Systems; 2020, Vol. 39 Issue 2, p2397-2408, 12p
- Publication Year :
- 2020
-
Abstract
- The task of Extractive Multi-Document Text Summarization (EMDTS) aims at building a short summary with essential information from a collection of documents. In this paper, we propose an EMDTS method using a Genetic Algorithm (GA). The fitness function considering two unsupervised text features: sentence position and coverage. We propose the binary coding representation, selection, crossover, and mutation operators. We test the proposed method on the DUC01 and DUC02 data set, four different tasks (summary lengths 200 and 400 words), for each of the collections of documents (in total, 876 documents) are tested. Besides, we analyze the most frequently used methodologies to summarization. Moreover, different heuristics such as topline, baseline, baseline-random, and lead baseline are calculated. In the results, the proposed method achieves to improve the state-of-art results. [ABSTRACT FROM AUTHOR]
- Subjects :
- BINARY codes
GENETIC algorithms
TEST methods
HEURISTIC
Subjects
Details
- Language :
- English
- ISSN :
- 10641246
- Volume :
- 39
- Issue :
- 2
- Database :
- Complementary Index
- Journal :
- Journal of Intelligent & Fuzzy Systems
- Publication Type :
- Academic Journal
- Accession number :
- 145429370
- Full Text :
- https://doi.org/10.3233/JIFS-179900