Back to Search Start Over

Truncated attention mechanism and cascade loss for cross-modal person re-identification.

Authors :
Shi, Shuo
Huo, Changwei
Guo, Yingchun
Lean, Stephen
Yan, Gang
Yu, Ming
Source :
Journal of Intelligent & Fuzzy Systems. 2021, Vol. 41 Issue 6, p6575-6587. 13p.
Publication Year :
2021

Abstract

Person re-identification with natural language description is a process of retrieving the corresponding person's image from an image dataset according to a text description of the person. The key challenge in this cross-modal task is to extract visual and text features and construct loss functions to achieve cross-modal matching between text and image. Firstly, we designed a two-branch network framework for person re-identification with natural language description. In this framework we include the following: a Bi-directional Long Short-Term Memory (Bi-LSTM) network is used to extract text features and a truncated attention mechanism is proposed to select the principal component of the text features; a MobileNet is used to extract image features. Secondly, we proposed a Cascade Loss Function (CLF), which includes cross-modal matching loss and single modal classification loss, both with relative entropy function, to fully exploit the identity-level information. The experimental results on the CUHK-PEDES dataset demonstrate that our method achieves better results in Top-5 and Top-10 than other current 10 state-of-the-art algorithms. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10641246
Volume :
41
Issue :
6
Database :
Academic Search Index
Journal :
Journal of Intelligent & Fuzzy Systems
Publication Type :
Academic Journal
Accession number :
154454890
Full Text :
https://doi.org/10.3233/JIFS-210382