Back to Search Start Over

Exploiting a Spatial Attention Mechanism for Improved Depth Completion and Feature Fusion in Novel View Synthesis

Authors :
Anh Minh Truong
Wilfried Philips
Peter Veelaert
Source :
IEEE Open Journal of Signal Processing, Vol 5, Pp 204-212 (2024)
Publication Year :
2024
Publisher :
IEEE, 2024.

Abstract

Many image-based rendering (IBR) methods rely on depth estimates obtained from structured light or time-of-flight depth sensors to synthesize novel views from sparse camera networks. However, these estimates often contain missing or noisy regions, resulting in an incorrect mapping between source and target views. This situation makes the fusion process more challenging, as the visual information is misaligned, inconsistent, or missing. In this work, we first implement a lightweight network based on the transformer, which is well-known for its capability to model long-range relationships within input data, to extract spatial features from color images. These features are then used to enhance the quality of completed depth maps. Furthermore, we combine a sequential deep neural network with a spatial attention mechanism to effectively fuse the projected features from multiple source viewpoints. This approach enables us to integrate information from an arbitrary number of source viewpoints as well as improve accuracy in synthesized views. Experimental results on challenging datasets demonstrate that our method achieves superior synthesized image quality compared to state-of-the-art (SOTA) methods.

Details

Language :
English
ISSN :
26441322
Volume :
5
Database :
Directory of Open Access Journals
Journal :
IEEE Open Journal of Signal Processing
Publication Type :
Academic Journal
Accession number :
edsdoj.2e1aa1810db0457a829ee68aeb45e168
Document Type :
article
Full Text :
https://doi.org/10.1109/OJSP.2023.3340064