Back to Search
Start Over
Recovering 6D object pose from RGB indoor image based on two-stage detection network with multi-task loss.
- Source :
-
Neurocomputing . Apr2019, Vol. 337, p15-23. 9p. - Publication Year :
- 2019
-
Abstract
- Abstract Object pose estimation from an RGB image has recently become a common problem owing to its widespread application. The advent of convolutional neural networks has impelled significant progress in object detection. However, most available methods do not involve a category-level pose estimation approach. This study presents an end-to-end 6D category-level pose estimation based on a two-stage bounding-box recognition backbone architecture. Our network directly outputs the 6D pose without requiring multiple stages or additional post-processing such as a Perspective-n-Point (PnP). The two-stage CNN architecture and our loss function render multi-task joint training effective and efficient. We improve the pose estimation accuracy by replacing fully connected layers with fully convolutional layers. Fully convolutional networks require fewer parameters and are less susceptible to overfitting. Moreover, we transform the pose estimation problem into classification and regression tasks using our network; these are called Pose-cls and Pose-reg, respectively. We also present qualitative and quantitative results on real data from the SUN RGB-D dataset. The experiments demonstrate the effectiveness of our algorithms compared to other state-of-the-art methods. [ABSTRACT FROM AUTHOR]
- Subjects :
- *POSE estimation (Computer vision)
*IMAGE
*ARTIFICIAL neural networks
Subjects
Details
- Language :
- English
- ISSN :
- 09252312
- Volume :
- 337
- Database :
- Academic Search Index
- Journal :
- Neurocomputing
- Publication Type :
- Academic Journal
- Accession number :
- 135351436
- Full Text :
- https://doi.org/10.1016/j.neucom.2018.12.061