Back to Search Start Over

Application of Improved Transformer Based onWeakly Supervised in Crowd Localization.

Authors :
GAO Hui
DENG Miaolei
ZHAO Wenjun
CHEN Faquan
ZHANG Dexian
Source :
Journal of Computer Engineering & Applications; Oct2023, Vol. 59 Issue 19, p92-98, 7p
Publication Year :
2023

Abstract

Aiming to address the issue of complex preprocessing and post-processing required by existing crowd localization methods that employ pseudo bounding boxes or pre-designed localization maps, an end-to-end crowd localization network based on weakly supervised, LocalFormer, is proposed. In the feature extraction stage, a pure Transformer is used as the backbone network, and a global max-pooling operation is performed on the features of each stage to extract more comprehensive details of human heads. In the encoder-decoder stage, the positional information is embedded into the aggregated features as input to the encoder. Each decoder layer uses a set of trainable embeddings as queries, and takes visual features of the last layer of the encoder as keys and values. The decoded features are then used to predict confidence scores. Finally, a binary module is introduced with an adaptive optimized threshold learner to precisely binarize the confidence maps. Extensive experiments on three datasets in different environments show that the proposed method achieves the best positioning performance. [ABSTRACT FROM AUTHOR]

Details

Language :
Chinese
ISSN :
10028331
Volume :
59
Issue :
19
Database :
Complementary Index
Journal :
Journal of Computer Engineering & Applications
Publication Type :
Academic Journal
Accession number :
172996953
Full Text :
https://doi.org/10.3778/j.issn.1002-8331.2206-0199