Back to Search
Start Over
Feature Pyramid Attention Model and Multi-Label Focal Loss for Pedestrian Attribute Recognition
- Source :
- IEEE Access, Vol 8, Pp 164570-164579 (2020)
- Publication Year :
- 2020
- Publisher :
- IEEE, 2020.
-
Abstract
- At present, there are many challenges in the field of pedestrian attribute recognition, such as small targets of some attributes, imbalanced samples, and low recognition accuracy of complex samples. In view of the above problems, we improved the model in two perspectives: 1) We proposed Feature Pyramid Attention Model (FPAM). In order to solve the problem that attributes are distributed in different locations in the pedestrian image, FPAM adopted the attention mechanism on the basis of ResNet-50, by which the model’s attention could be focused on key areas of the image. As for the difficulty in small targets attributes, we adopted feature pyramid integration strategy; 2) We proposed Multi Label Focal Loss (MLFL). Referring to Binary Cross Entropy Loss Function (CE) and Weight Binary Cross Entropy Loss Function (WCE), we added the weight parameters of samples which are difficult to classify to improve the recognition accuracy, and the rate of convergence was increased. Results show that our proposed method achieves 84.83% mA, 79.37% Accuracy, 87.47% Precision, 86.09% Recall, and 86.77% F1 on PETA dataset.
Details
- Language :
- English
- ISSN :
- 21693536
- Volume :
- 8
- Database :
- Directory of Open Access Journals
- Journal :
- IEEE Access
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.bc355c114c0541b189cb1fcc0549059b
- Document Type :
- article
- Full Text :
- https://doi.org/10.1109/ACCESS.2020.3010435