Back to Search Start Over

Multi-scale object detection by top-down and bottom-up feature pyramid network.

Authors :
Baojun ZHAO
Boya ZHAO
Linbo TANG
Wenzheng WANG
Chen WU
Source :
Journal of Systems Engineering & Electronics. Feb2019, Vol. 30 Issue 1, p1-12. 12p.
Publication Year :
2019

Abstract

While moving ahead with the object detection technology, especially deep neural networks, many related tasks, such as medical application and industrial automation, have achieved great success. However, the detection of objects with multiple aspect ratios and scales is still a key problem. This paper proposes a top-down and bottom-up feature pyramid network (TDBU-FPN), which combines multi-scale feature representation and anchor generation at multiple aspect ratios. First, in order to build the multi-scale feature map, this paper puts a number of fully convolutional layers after the backbone. Second, to link neighboring feature maps, top-down and bottom-up flows are adopted to introduce context information via top-down flow and supplement suboriginal information via bottom-up flow. The top-down flow refers to the deconvolution procedure, and the bottom-up flow refers to the pooling procedure. Third, the problem of adapting different object aspect ratios is tackled via many anchor shapes with different aspect ratios on each multi-scale feature map. The proposed method is evaluated on the pattern analysis, statistical modeling and computational learning visual object classes (PASCAL VOC) dataset and reaches an accuracy of 79%, which exhibits a 1.8% improvement with a detection speed of 23 fps. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10044132
Volume :
30
Issue :
1
Database :
Academic Search Index
Journal :
Journal of Systems Engineering & Electronics
Publication Type :
Periodical
Accession number :
136458223
Full Text :
https://doi.org/10.21629/JSEE.2019.01.01