Back to Search Start Over

PoeTree: Poetry Treebanks in Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian and Spanish

Authors :
Plecháč, Petr
Cinková, Silvie
Kolár, Robert
Šeļa, Artjoms
De Sisto, Mirella
Nugues, Lara
Haider, Thomas
Kočnik, Neža
Plecháč, Petr
Cinková, Silvie
Kolár, Robert
Šeļa, Artjoms
De Sisto, Mirella
Nugues, Lara
Haider, Thomas
Kočnik, Neža
Source :
Research Data Journal for the Humanities and Social Sciences (2024) [ISSN 2452-3666]
Publication Year :
2024

Abstract

This article presents a set of standardised corpora of poetry comprising over 330,000 poems in ten languages (Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian, and Spanish). Each corpus has been deduplicated, enriched with Universal Dependencies, provided with additional metadata, and converted into a unified json structure.

Details

Database :
OAIster
Journal :
Research Data Journal for the Humanities and Social Sciences (2024) [ISSN 2452-3666]
Notes :
DOI: 10.1163/24523666-bja10044, Research Data Journal for the Humanities and Social Sciences (2024) [ISSN 2452-3666], English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1468026962
Document Type :
Electronic Resource