Back to Search Start Over

Semi-supervised named entity recognition in multi-level contexts.

Authors :
Chen, Yubo
Wu, Chuhan
Qi, Tao
Yuan, Zhigang
Zhang, Yuesong
Yang, Shuai
Guan, Jian
Sun, Donghong
Huang, Yongfeng
Source :
Neurocomputing. Feb2023, Vol. 520, p194-204. 11p.
Publication Year :
2023

Abstract

[Display omitted] Named entity recognition is a critical task in the natural language processing field. Most existing methods for this task can only exploit contextual information within a sentence. However, their performance on recognizing entities in limited or ambiguous sentence-level contexts is usually unsatisfactory. Fortunately, other sentences in the same document can provide supplementary document-level contexts to help recognize these entities. In addition, words themselves contain word-level contextual information since they usually have different preferences of entity type and relative position from named entities. In this paper, we propose a semi-supervised unified framework to incorporate multi-level contexts for named entity recognition. We use bi-directional gated recurrent units and incorporate pre-trained language model embeddings to capture sentence-level contextual information. To incorporate document-level contexts, we propose to capture interactions between sentences via a multi-head self attention network. To mine word-level contexts, we propose an auxiliary task to predict the type of each word to capture its type preference. We jointly train our model in entity recognition and the auxiliary classification task via multi-task learning. We conduct experiments on two widely-used sequence taggers: CRF tagger and boundary tagger. The experimental results on the CoNLL dataset in English, Dutch, and German validate the effectiveness of our method. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09252312
Volume :
520
Database :
Academic Search Index
Journal :
Neurocomputing
Publication Type :
Academic Journal
Accession number :
160939321
Full Text :
https://doi.org/10.1016/j.neucom.2022.11.064