Back to Search Start Over

Research Progress of Multimodal Named Entity Recognition.

Authors :
WANG Hairong
XU Xi
WANG Tong
JING Boxiang
Source :
Journal of Zhengzhou University: Engineering Science; Mar2024, Vol. 45 Issue 2, p60-71, 12p
Publication Year :
2024

Abstract

In order to solve the problems in studies of multimodal named entity recognition, such as the lack of text feature semantics, the lack of visual feature semantics, and the difficulty of graphic feature fusion, a series of multimodal named entity recognition methods were proposed. Firstly, the overall framework of multi modal named entity recognition methods and common technologies in each part were examined, and classified into BilSTM-based MNER method and Transformer based MNER method. Furthermore, according to the model structure, it was further divided into four model structures, including pre-fusion model, post-fusion model, Transformer single-task model and Transformer multi-task model. Then, experiments were carried out on two data sets of Twitter-2015 and Twitter-2017 for these two types of methods respectively. The experimental results showed that multi-feature cooperative representation could enhance the semantics of each modal feature. In addition, multi-task learning could promote modal feature fusion or result fusion, so as to improve the accuracy of MNER. Finally, in the future research of MNER, it was suggested to focus on enhancing modal semantics through multi-feature cooperative representation, and promoting model feature fusion or result fusion by multi-task learning. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
16716833
Volume :
45
Issue :
2
Database :
Complementary Index
Journal :
Journal of Zhengzhou University: Engineering Science
Publication Type :
Academic Journal
Accession number :
176110614
Full Text :
https://doi.org/10.13705/j.issn.1671-6833.2024.02.001