1. VivesDebate: A New Annotated Multilingual Corpus of Argumentation in a Debate Tournament
- Author
-
Mariona Taulé, Ramon Ruiz-Dolz, Ana García-Fornes, Stella Heras, and Montserrat Nofre
- Subjects
Technology ,Computer science ,Debate ,02 engineering and technology ,Corpus ,Argumentation theory ,Resource (project management) ,Argument ,0202 electrical engineering, electronic engineering, information engineering ,General Materials Science ,Oratòria ,Tractament del llenguatge natural (Informàtica) ,Biology (General) ,Instrumentation ,Fluid Flow and Transfer Processes ,Oratory ,Physics ,General Engineering ,Engineering (General). Civil engineering (General) ,Argumentació (Lingüística) ,Linguistics ,Corpus (Lingüística) ,Computer Science Applications ,Chemistry ,Argumentation (Linguistics) ,020201 artificial intelligence & image processing ,TA1-2040 ,Argumentative ,Argument analysis ,QH301-705.5 ,QC1-999 ,Context (language use) ,Annotation ,Natural language processing (Computer science) ,Argument mining ,020204 information systems ,Argumentation ,Set (psychology) ,QD1-999 ,Argument evaluation ,business.industry ,Process Chemistry and Technology ,Deep learning ,Natural language processing ,Corpora (Linguistics) ,Argument generation ,Debats ,Artificial intelligence ,business ,LENGUAJES Y SISTEMAS INFORMATICOS ,Debates - Abstract
[EN] The application of the latest Natural Language Processing breakthroughs in computational argumentation has shown promising results, which have raised the interest in this area of research. However, the available corpora with argumentative annotations are often limited to a very specific purpose or are not of adequate size to take advantage of state-of-the-art deep learning techniques (e.g., deep neural networks). In this paper, we present VivesDebate, a large, richly annotated and versatile professional debate corpus for computational argumentation research. The corpus has been created from 29 transcripts of a debate tournament in Catalan and has been machine-translated into Spanish and English. The annotation contains argumentative propositions, argumentative relations, debate interactions and professional evaluations of the arguments and argumentation. The presented corpus can be useful for research on a heterogeneous set of computational argumentation underlying tasks such as Argument Mining, Argument Analysis, Argument Evaluation or Argument Generation, among others. All this makes VivesDebate a valuable resource for computational argumentation research within the context of massive corpora aimed at Natural Language Processing tasks., This research was funded by the Spanish Government project PID2020-113416RB-I00; the Valencian Government project PROMETEO/2018/002; the MISMIS-Language project (PGC2018096212-B-C33) funded by Ministerio de Ciencia, Innovacion y Universidades; and the CLiC Research Group (2017SGR341) funded by Generalitat de Catalunya.
- Published
- 2021