Back to Search Start Over

On German verb sense disambiguation: A three-part approach based on linking a sense inventory (GermaNet) to a corpus through annotation (TGVCorp) and using the corpus to train a VSD classifier (TTvSense).

Authors :
Mattern, Dominik
Hemati, Wahed
Lücking, Andy
Mehler, Alexander
Source :
Journal of Language Modelling; Jun2024, Vol. 12 Issue 1, p155-212, 58p
Publication Year :
2024

Abstract

We develop a three-part approach to Verb Sense Disambiguation (VSD) in German. After considering a set of lexical resources and corpora, we arrive at a statistically motivated selection of a subset of verbs and their senses from GermaNet. This sub-inventory is then used to disambiguate the occurrences of the corresponding verbs in a corpus resulting from the union of TüBa-D/Z, Salsa, and E-VALBU. The corpus annotated in this way is called TGVCorp. It is used in the third part of the paper for training a classifier for VSD and for its comparative evaluation with a state-of-the-art approach in this research area, namely EWISER. Our simple classifier outperforms the transformer-based approach on the same data in both accuracy and speed in German but not in English and we discuss possible reasons. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
2299856X
Volume :
12
Issue :
1
Database :
Complementary Index
Journal :
Journal of Language Modelling
Publication Type :
Academic Journal
Accession number :
179914305
Full Text :
https://doi.org/10.15398/jlm.v12i1.356