Back to Search
Start Over
Transformers analyzing poetry: multilingual metrical pattern prediction with transfomer-based language models
- Source :
- Neural Computing and Applications. Springer London
- Publication Year :
- 2021
- Publisher :
- Springer Science and Business Media LLC, 2021.
-
Abstract
- The splitting of words into stressed and unstressed syllables is the foundation for the scansion of poetry, a process that aims at determining the metrical pattern of a line of verse within a poem. Intricate language rules and their exceptions, as well as poetic licenses exerted by the authors, make calculating these patterns a nontrivial task. Some rhetorical devices shrink the metrical length, while others might extend it. This opens the door for interpretation and further complicates the creation of automated scansion algorithms useful for automatically analyzing corpora on a distant reading fashion. In this paper, we compare the automated metrical pattern identification systems available for Spanish, English, and German, against fine-tuned monolingual and multilingual language models trained on the same task. Despite being initially conceived as models suitable for semantic tasks, our results suggest that transformers-based models retain enough structural information to perform reasonably well for Spanish on a monolingual setting, and outperforms both for English and German when using a model trained on the three languages, showing evidence of the benefits of cross-lingual transfer between the languages.
- Subjects :
- Computer science
media_common.quotation_subject
Language models
02 engineering and technology
METER
computer.software_genre
Task (project management)
German
Artificial Intelligence
Reading (process)
0502 economics and business
0202 electrical engineering, electronic engineering, information engineering
Digital humanities
media_common
Interpretation (logic)
Poetry
business.industry
Natural language processing
05 social sciences
SCANSION
language.human_language
Rhetorical device
language
020201 artificial intelligence & image processing
Language model
Artificial intelligence
Scansion
business
computer
050203 business & management
Software
Subjects
Details
- ISSN :
- 14333058 and 09410643
- Database :
- OpenAIRE
- Journal :
- Neural Computing and Applications
- Accession number :
- edsair.doi.dedup.....bd5e119c1097494e6afa836daa943b27
- Full Text :
- https://doi.org/10.1007/s00521-021-06692-2