1. LARFNet: Lightweight asymmetric refining fusion network for real-time semantic segmentation.
- Author
-
Hu, Xuegang and Gong, Juelin
- Subjects
- *
PROBLEM solving , *MULTICASTING (Computer networks) , *PYRAMIDS , *PIXELS - Abstract
In this paper, we propose a lightweight asymmetric refining fusion network (LARFNet) for real-time semantic segmentation to solve the problem that some existing models cannot achieve good segmentation accuracy with real-time inference speed in mobile devices due to the huge computational overhead. Specifically, LARFNet adopts an asymmetric encoder–decoder structure. The depth-wise separable asymmetric interaction module (DSAI module) is designed in the encoding process, which effectively extracted local and surrounding information under different receptive fields with optimized convolution in the condition of ensuring communication between channels. In the decoder, we design the bilateral pyramid pooling attention module (BPPA module) and the multi-stage refinement fusion module (MRF Module). The BPPA module is used to integrate the high-level output multi-scale context information. Based on spatial and channel attention mechanisms, the MRF module is proposed to refine the feature maps of different resolutions and guide the feature fusion. Experimental results show that LARFNet achieves 69.2% mIoU and 65.6% mIoU on Cityscapes and Camvid datasets at 127 FPS and 222 FPS respectively, only using a single NVIDIA GeForce GTX2080Ti GPU and 0.72M parameters without any pre-training or pre-processing. Compared with most of the existing state-of-the-art models, the proposed method realizes the efficient use of network parameters at a faster speed, reduces the number of network parameters, and still achieves the accuracy of good segmentation. [Display omitted] • In this paper, we propose a lightweight real-time semantic segmentation network which considers inference speed, number of model parameters and segmentation accuracy as a whole, named lightweight asymmetric refining fusion network (LARFNet). The network mainly consists of three modules. The DSAI module is used to extract different features. The BPPA module is used to provide pixel-level attention for features, and the MRF module is used to guide feature fusion after optimizing features. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF