Back to Search
Start Over
On German verb sense disambiguation: A three-part approach based on linking a sense inventory (GermaNet) to a corpus through annotation (TGVCorp) and using the corpus to train a VSD classifier (TTvSense).
- Source :
- Journal of Language Modelling; Jun2024, Vol. 12 Issue 1, p155-212, 58p
- Publication Year :
- 2024
-
Abstract
- We develop a three-part approach to Verb Sense Disambiguation (VSD) in German. After considering a set of lexical resources and corpora, we arrive at a statistically motivated selection of a subset of verbs and their senses from GermaNet. This sub-inventory is then used to disambiguate the occurrences of the corresponding verbs in a corpus resulting from the union of TüBa-D/Z, Salsa, and E-VALBU. The corpus annotated in this way is called TGVCorp. It is used in the third part of the paper for training a classifier for VSD and for its comparative evaluation with a state-of-the-art approach in this research area, namely EWISER. Our simple classifier outperforms the transformer-based approach on the same data in both accuracy and speed in German but not in English and we discuss possible reasons. [ABSTRACT FROM AUTHOR]
- Subjects :
- GERMAN language
SUBSET selection
VERBS
CORPORA
SENSES
Subjects
Details
- Language :
- English
- ISSN :
- 2299856X
- Volume :
- 12
- Issue :
- 1
- Database :
- Complementary Index
- Journal :
- Journal of Language Modelling
- Publication Type :
- Academic Journal
- Accession number :
- 179914305
- Full Text :
- https://doi.org/10.15398/jlm.v12i1.356