Back to Search Start Over

Multi-modal Human pose estimation based on probability distribution perception on a depth convolution neural network.

Authors :
Wang, Xunjun
Hu, Xiaochun
Li, Yun
Jiang, Caoqing
Source :
Pattern Recognition Letters. Jan2022, Vol. 153, p36-43. 8p.
Publication Year :
2022

Abstract

• A method of human key-point coordinate representation based on probability distribution perception. • Addressing issues :key-point representation quantization error, algorithm inconsistency. • This method gives a high level of precision in the experiment. Existing human pose estimation methods have some problems, such as key-point representation quantization error, algorithm inconsistency, and easy-to-introduce unequal quantization error. Addressing these issues, this paper proposes a method of human key-point coordinate representation based on probability distribution perception on a depth convolution neural network. In this method, the perceived probability of the key-point coordinate distribution is extracted by a convolutional neural network, and the cross entropy is formed with the prediction probability distribution as the loss function. Under the constraint of minimizing the Kullback-Leibler (KL) distance between the prediction and the ground truth, the convolution neural network is optimized, and the coordinates of the heat map are finally located. The experimental results show that compared with the traditional method based on a heat map, the proposed method can accelerate the reasoning speed of the model in the COCO training set, and at the same time, it can improve the map by 1.7 [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01678655
Volume :
153
Database :
Academic Search Index
Journal :
Pattern Recognition Letters
Publication Type :
Academic Journal
Accession number :
154692379
Full Text :
https://doi.org/10.1016/j.patrec.2021.11.021