Back to Search
Start Over
Deep feature network with multi-scale fusion for highly congested crowd counting.
- Source :
- International Journal of Machine Learning & Cybernetics; Mar2024, Vol. 15 Issue 3, p819-835, 17p
- Publication Year :
- 2024
-
Abstract
- In this paper, we propose a deep feature network with multi-scale fusion (DFNet) for addressing the problem of crowd counting in highly congested noisy scenes. DFNet contains three modules: feature encoder, feature decoder and feature fusion. The feature encoder uses a VGG-16-based convolutional neural network (CNN) that encodes features from images and forms a kind of low-level spatial information. The feature decoder is a multi-column dilated convolutional neural network (McDCNN) with different dilation rates that can capture a multi-scale contextual information, decode the low-level spatial information and generate a kind of high-level semantic information. Furthermore, the multi-column architecture in McDCNN can effectively relieve the "gridding" issue presented in the dilated convolution framework. The feature fusion block uses a simple and effective network architecture to sufficiently incorporate the low-level spatial and the high-level semantic information for facilitating high-quality density map estimation and performing accurate crowd counting. Extensive experiments on several highly challenging crowd counting datasets are conducted. Experimental results show that DFNet is comparable with recent state-of-the-art approaches. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 18688071
- Volume :
- 15
- Issue :
- 3
- Database :
- Complementary Index
- Journal :
- International Journal of Machine Learning & Cybernetics
- Publication Type :
- Academic Journal
- Accession number :
- 175360837
- Full Text :
- https://doi.org/10.1007/s13042-023-01941-3