1. Deformable channel non‐local network for crowd counting
- Author
-
Ting Zhang, Huake Wang, Kaibing Zhang, and Xingsong Hou
- Subjects
image processing ,image recognition ,image and vision processing and display technology ,learning systems ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Abstract Both global dependency and local correlation are crucial for solving the scale variation of crowd. However, most of previous methods fail to take two factors into consideration simultaneously. Against the aforementioned issue, a deformable channel non‐local network, abbreviated as DCNLNet for crowd counting, which can simultaneously learn global context information and adaptive local receptive field is proposed. Specifically, the proposed DCNLNet consists of two well‐crafted designed modules: deformable channel non‐local block (DCNL) and spatial attention feature fusion block (SAFF). The DCNL encodes long‐range dependencies between pixels and the adaptive local correlation with channel non‐local and deformable convolution, respectively, benefiting for improving the spatial discrimination of features. While the SAFF aims to aggregate the cross‐level information, which interacts these features from different depths and learns specific weights for the feature maps with spatial attention. Extensive experiments are performed on three crowd counting benchmark datasets and experimental results indicate that the proposed DCNLNet achieves compelling performance compared to other representative counting models.
- Published
- 2023
- Full Text
- View/download PDF