Back to Search Start Over

Efficient Selective Context Network for Accurate Object Detection.

Authors :
Nie, Jing
Pang, Yanwei
Zhao, Shengjie
Han, Jungong
Li, Xuelong
Source :
IEEE Transactions on Circuits & Systems for Video Technology. Sep2021, Vol. 31 Issue 9, p3456-3468. 13p.
Publication Year :
2021

Abstract

Single-stage detectors have gained great attention due to their high detection accuracy and real-time speed. To detect multi-scale objects, single-stage detectors make scale-aware predictions based on multiple pyramid layers. However, the insufficient context exploration in shallow pyramid layers leads to the detection accuracy of small objects being far from satisfactory. To tackle this problem, we propose a scheme to selectively extract multi-scale context with attention-adaptive weights. Specifically, we propose an efficient selective context network for accurate object detection. It incorporates an enhanced context module and a triple attention module. The enhanced context module consists of multi-branches to extract original-scale, small-scale, and large-scale contextual information. To make full use of this context and filter out noisy information, the triple attention module, which contains global-level, channel-level, and spatial-level attentions, is introduced to carry out selective context fusion. The two modules are easy to implement and can efficiently boost the accuracy of object detection. The performance of our method is validated on two benchmarks: PASCAL VOC and MS COCO. For a $512\times 512$ input, our detector with VGG16 achieves competitive results (80.9 on the Pascal VOC 2012 test set in the case of single-scale inference without MS COCO pre-training). On the MS COCO test-dev set, our detector with ResNet101 outperforms RetinaNet500 by 2.5% AP in terms of overall performance and its speed is 48 milliseconds on a Titan XP GPU. As a result, ESCNet achieves a better trade-off between accuracy and speed. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10518215
Volume :
31
Issue :
9
Database :
Academic Search Index
Journal :
IEEE Transactions on Circuits & Systems for Video Technology
Publication Type :
Academic Journal
Accession number :
153376817
Full Text :
https://doi.org/10.1109/TCSVT.2020.3038649