Back to Search
Start Over
PoeTree: Poetry Treebanks in Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian and Spanish
- Source :
- Research Data Journal for the Humanities and Social Sciences (2024) [ISSN 2452-3666]
- Publication Year :
- 2024
-
Abstract
- This article presents a set of standardised corpora of poetry comprising over 330,000 poems in ten languages (Czech, English, French, German, Hungarian, Italian, Portuguese, Russian, Slovenian, and Spanish). Each corpus has been deduplicated, enriched with Universal Dependencies, provided with additional metadata, and converted into a unified json structure.
Details
- Database :
- OAIster
- Journal :
- Research Data Journal for the Humanities and Social Sciences (2024) [ISSN 2452-3666]
- Notes :
- DOI: 10.1163/24523666-bja10044, Research Data Journal for the Humanities and Social Sciences (2024) [ISSN 2452-3666], English
- Publication Type :
- Electronic Resource
- Accession number :
- edsoai.on1468026962
- Document Type :
- Electronic Resource