Back to Search Start Over

Indoor scene understanding via RGB-D image segmentation employing depth-based CNN and CRFs.

Authors :
Li, Wei
Gu, Junhua
Dong, Yongfeng
Dong, Yao
Han, Jungong
Source :
Multimedia Tools & Applications; 2020, Vol. 79 Issue 47/48, p35475-35489, 15p
Publication Year :
2020

Abstract

With the availability of low-cost depth-visual sensing devices, such as Microsoft Kinect, we are experiencing a growing interest in indoor environment understanding, at the core of which is semantic segmentation in RGB-D image. The latest research shows that the convolutional neural network (CNN) still dominates the image semantic segmentation field. However, down-sampling operated during the training process of CNNs leads to unclear segmentation boundaries and poor classification accuracy. To address this problem, in this paper, we propose a novel end-to-end deep architecture, termed FuseCRFNet, which seamlessly incorporates a fully-connected Conditional Random Fields (CRFs) model into a depth-based CNN framework. The proposed segmentation method uses the properties of pixel-to-pixel relationships to increase the accuracy of image semantic segmentation. More importantly, we formulate the CRF as one of the layers in FuseCRFNet to refine the coarse segmentation in the forward propagation, in meanwhile, it passes back the errors to facilitate the training. The performance of our FuseCRFNet is evaluated by experimenting with SUN RGB-D dataset, and the results show that the proposed algorithm is superior to existing semantic segmentation algorithms with an improvement in accuracy of at least 2%, further verifying the effectiveness of the algorithm. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13807501
Volume :
79
Issue :
47/48
Database :
Complementary Index
Journal :
Multimedia Tools & Applications
Publication Type :
Academic Journal
Accession number :
147410964
Full Text :
https://doi.org/10.1007/s11042-019-07882-w