Back to Search Start Over

The enrichment of lexical resources through incremental parsebanking.

Authors :
Rosén, Victoria
Thunes, Martha
Haugereid, Petter
Losnegaard, Gyri
Dyvik, Helge
Meurer, Paul
Lyse, Gunn
Smedt, Koenraad
Source :
Language Resources & Evaluation. Jun2016, Vol. 50 Issue 2, p291-319. 29p.
Publication Year :
2016

Abstract

Automatic syntactic analysis of a corpus requires detailed lexical and morphological information that cannot always be harvested from traditional dictionaries. Therefore the development of a treebank presents an opportunity to simultaneously enrich the lexicon. In building NorGramBank, we use an incremental parsebanking approach, in which a corpus is parsed and disambiguated, and after improvements to the grammar and the lexicon, reparsed. In this context we have implemented a text preprocessing interface where annotators can enter unknown words or missing lexical information either before parsing or during disambiguation. The information added to the lexicon in this way may be of great interest both to lexicographers and to other language technology efforts. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1574020X
Volume :
50
Issue :
2
Database :
Academic Search Index
Journal :
Language Resources & Evaluation
Publication Type :
Academic Journal
Accession number :
116036660
Full Text :
https://doi.org/10.1007/s10579-016-9356-5