Back to Search Start Over

Feature Pyramid Attention Model and Multi-Label Focal Loss for Pedestrian Attribute Recognition

Authors :
Ye Li
Fangyan Shi
Shaoqi Hou
Jipeng Li
Chao Li
Guangqiang Yin
Source :
IEEE Access, Vol 8, Pp 164570-164579 (2020)
Publication Year :
2020
Publisher :
IEEE, 2020.

Abstract

At present, there are many challenges in the field of pedestrian attribute recognition, such as small targets of some attributes, imbalanced samples, and low recognition accuracy of complex samples. In view of the above problems, we improved the model in two perspectives: 1) We proposed Feature Pyramid Attention Model (FPAM). In order to solve the problem that attributes are distributed in different locations in the pedestrian image, FPAM adopted the attention mechanism on the basis of ResNet-50, by which the model’s attention could be focused on key areas of the image. As for the difficulty in small targets attributes, we adopted feature pyramid integration strategy; 2) We proposed Multi Label Focal Loss (MLFL). Referring to Binary Cross Entropy Loss Function (CE) and Weight Binary Cross Entropy Loss Function (WCE), we added the weight parameters of samples which are difficult to classify to improve the recognition accuracy, and the rate of convergence was increased. Results show that our proposed method achieves 84.83% mA, 79.37% Accuracy, 87.47% Precision, 86.09% Recall, and 86.77% F1 on PETA dataset.

Details

Language :
English
ISSN :
21693536
Volume :
8
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.bc355c114c0541b189cb1fcc0549059b
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2020.3010435