Back to Search Start Over

Unsupervised extractive multi-document text summarization using a genetic algorithm.

Authors :
Neri-Mendoza, Verónica
Ledeneva, Yulia
García-Hernández, René Arnulfo
Pinto, David
Singh, Vivek
Perez, Fernando
Source :
Journal of Intelligent & Fuzzy Systems; 2020, Vol. 39 Issue 2, p2397-2408, 12p
Publication Year :
2020

Abstract

The task of Extractive Multi-Document Text Summarization (EMDTS) aims at building a short summary with essential information from a collection of documents. In this paper, we propose an EMDTS method using a Genetic Algorithm (GA). The fitness function considering two unsupervised text features: sentence position and coverage. We propose the binary coding representation, selection, crossover, and mutation operators. We test the proposed method on the DUC01 and DUC02 data set, four different tasks (summary lengths 200 and 400 words), for each of the collections of documents (in total, 876 documents) are tested. Besides, we analyze the most frequently used methodologies to summarization. Moreover, different heuristics such as topline, baseline, baseline-random, and lead baseline are calculated. In the results, the proposed method achieves to improve the state-of-art results. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10641246
Volume :
39
Issue :
2
Database :
Complementary Index
Journal :
Journal of Intelligent & Fuzzy Systems
Publication Type :
Academic Journal
Accession number :
145429370
Full Text :
https://doi.org/10.3233/JIFS-179900