Back to Search Start Over

Attentive encoder-decoder networks for crowd counting.

Authors :
Liu, Xuhui
Hu, Yutao
Zhang, Baochang
Zhen, Xiantong
Luo, Xiaoyan
Cao, Xianbin
Source :
Neurocomputing. Jun2022, Vol. 490, p246-257. 12p.
Publication Year :
2022

Abstract

Crowd counting that aims to estimate the crowd density has recently made significant progress but remains an unsolved problem due to several challenges. In this paper, we propose an Attentive Encoder-Decoder Network (AEDNet) to overcome the notorious scale-variation problem in crowd counting. Our major contributions can be summarized in three aspects. First, we design an Attentive Feature Refinement (AFR) block in the encoder to adaptively extract multi-scale features. AFR compares the spatial information in different scales through the attention mechanism and then adaptively assign importance weights to each point, which highlights the distinctive roles in multi-scale feature extraction. Second, we develop a Separable Non-local Fusion (SNF) block in the decoder with the self-attention mechanism to aggregate multi-scale features from different layers, which not only achieves the sufficient feature fusion by capturing long-range dependencies, but also vastly reduces the computation cost compared to the original non-local operation. Third, we propose a Regional MSE (R-MSE) loss to tackle the pixel-isolation problems in regular MSE loss. To demonstrate the effectiveness of the proposed AEDNet, we conduct extensive experiments on four widely-used crowd counting datasets, and our AEDNet consistently achieves the state-of-the-art performance. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09252312
Volume :
490
Database :
Academic Search Index
Journal :
Neurocomputing
Publication Type :
Academic Journal
Accession number :
156362331
Full Text :
https://doi.org/10.1016/j.neucom.2021.11.087