Back to Search Start Over

Annotador: a temporal tagger for Spanish.

Authors :
Navas-Loro, María
Rodríguez-Doncel, Víctor
Pinto, David
Singh, Vivek
Perez, Fernando
Source :
Journal of Intelligent & Fuzzy Systems. 2020, Vol. 39 Issue 2, p1979-1991. 13p.
Publication Year :
2020

Abstract

Temporal information is crucial in knowledge extraction. Being able to locate events in a timeline is necessary to understand the narrative behind every text. To this aim, several temporal taggers have been proposed in literature –nevertheless, not all languages received the same attention. Most taggers work only for English texts, and not many have been developed for other languages. Also the scarcity of annotated corpora in other languages notably hinders the task. In this paper we present a new rule-based tagger called Annotador (Añotador in Spanish) able to process texts both in Spanish and English. Furthermore, a new corpus with more than 300 short texts containing common temporal expressions, called the HourGlass corpus, has been built in order to test it and to facilitate the development of new resources and tools. Professionals from different domains intervened in the gathering of the text, making it heterogeneous and easy to use thanks to the tags added to each entry. Finally, we analyzed main challenges in the time expression extraction task. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
*CORPORA
*SPANISH language

Details

Language :
English
ISSN :
10641246
Volume :
39
Issue :
2
Database :
Academic Search Index
Journal :
Journal of Intelligent & Fuzzy Systems
Publication Type :
Academic Journal
Accession number :
145429335
Full Text :
https://doi.org/10.3233/JIFS-179865