Back to Search Start Over

ALNet: An adaptive channel attention network with local discrepancy perception for accurate indoor visual localization.

Authors :
Gao, Hongbo
Dai, Kun
Wang, Ke
Li, Ruifeng
Zhao, Lijun
Wu, Mengyuan
Source :
Expert Systems with Applications. Sep2024, Vol. 250, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

Visual localization, a fundamental component of several computer vision tasks, has been predominantly realized by scene coordinate regression (SCoRe) techniques. These methods leverage neural networks for scene coordinates prediction, followed by a PnP algorithm to recover the 6-DOF camera pose. However, similar image patches are prevalent in indoor scenes, which results in the extraction of comparable features for the regression of different scene coordinates. As a result, the localization accuracy is severely declined. In this work, we develop ALNet, a novel SCoRe method that incorporates a local discrepancy perception module (LDPM) and an adaptive channel attention module (ACAM) to address this challenge. For LDPM, our key insight lies in that scene attributes around different similar image patches are inconsistent. Technically, for each image patch, LDPM identifies a certain number of the most dissimilar patches around it and computes difference vectors to enrich its own features, thereby enabling the differentiation of similar image patches. Considering geometric attributes are beneficial for distinguishing similar patches while semantic context is conducive to encoding regression issues, integrating multi-level features is an effective approach to elevate the localization accuracy. Therefore, ACAM concatenates multi-level features together and leverages both average pooling and max pooling to generate reliable channel-wise weighting coefficient, thereby modeling the correlation among channels to integrate multi-level features effectively. Comprehensive experiments are conducted on mainstream indoor localization benchmarks and an actual environment, showing that ALNet achieves impressive performance. Source code and the experimental results video are available at https://github.com/DAMMONGAO/alnet. • We propose a SCoRe-based method that achieves excellent localization accuracy. • We propose a module to distinguish similar image patches to improve accuracy. • We propose an attention module to integrate multi-level features. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09574174
Volume :
250
Database :
Academic Search Index
Journal :
Expert Systems with Applications
Publication Type :
Academic Journal
Accession number :
177285696
Full Text :
https://doi.org/10.1016/j.eswa.2024.123792