Back to Search Start Over

A Novel Adaptive Attention Model for Image Captioning

Authors :
Ming Ding
Donglin Liang
Jinzhao Wu
Anping He
Source :
Journal of Physics: Conference Series. 1549:032131
Publication Year :
2020
Publisher :
IOP Publishing, 2020.

Abstract

Image captioning is an extremely challenging work, which involves both computer vision and natural language processing. However, in recent years, with the introduction and improvement of various new technologies, it has made a great advancement. In this paper, a novel attention-based image caption model is proposed. We use Resnet harnessed by outstanding performance in convolutional neural network, as the encoder. For the decoding, LSTM network equipping with fitness of dealing with problems highly related to time series, is adopted. At each time step, the attention mechanism guides the language model to synthesize context vectors adaptively. The experimental results show that the model has good performance in various quantitative metrics as well as in human evaluation.

Details

ISSN :
17426596 and 17426588
Volume :
1549
Database :
OpenAIRE
Journal :
Journal of Physics: Conference Series
Accession number :
edsair.doi...........54e30c9cbdd925ab6c60b2eb5c28dda6
Full Text :
https://doi.org/10.1088/1742-6596/1549/3/032131