Back to Search Start Over

Efficient Task-Specific Feature Re-Fusion for More Accurate Object Detection and Instance Segmentation

Authors :
Wang, Cheng
Fang, Yuxin
Fang, Jiemin
Guo, Peng
Wu, Rui
Huang, He
Wang, Xinggang
Huang, Chang
Liu, Wenyu
Source :
IEEE Transactions on Circuits and Systems for Video Technology; 2024, Vol. 34 Issue: 7 p5350-5360, 11p
Publication Year :
2024

Abstract

Feature pyramid representations have been widely adopted in the object detection literature for better handling of variations in scale, which provide abundant information from various spatial levels for classification and localization sub-tasks. We find that inter sub-task feature disentanglement and intra sub-task feature re-fusion are crucial for final prediction performance, but are hard to be achieved simultaneously considering the computational efficiency. We find this issue can be addressed by delicate module design. In this paper, we propose an Efficient Task-specific Feature Re-fusion (ETFR) module to mitigate the dilemma. ETFR disentangles inter sub-task features, reduces the output channels of multi-scale features based on their importance and re-fuses intra sub-task features via concatenation operation. As a plug-and-play module, ETFR can remarkably and consistently improve the well-established and highly-optimized object detection and instance segmentation methods, such as RetinaNet, FCOS, BlendMask and CondInst, with neglectable extra computation cost. Extensive experiments demonstrate that ETFR has good generalization ability on various changeling datasets, including COCO, LVIS and Cityscapes.

Details

Language :
English
ISSN :
10518215 and 15582205
Volume :
34
Issue :
7
Database :
Supplemental Index
Journal :
IEEE Transactions on Circuits and Systems for Video Technology
Publication Type :
Periodical
Accession number :
ejs66895162
Full Text :
https://doi.org/10.1109/TCSVT.2023.3344713