Back to Search Start Over

Cross-View Geo-Localization with Street-View and VHR Satellite Imagery in Decentrality Settings

Authors :
Xia, Panwang
Yu, Lei
Wan, Yi
Wu, Qiong
Chen, Peiqi
Zhong, Liheng
Yao, Yongxiang
Wei, Dong
Liu, Xinyi
Ru, Lixiang
Zhang, Yingying
Lao, Jiangwei
Chen, Jingdong
Yang, Ming
Zhang, Yongjun
Publication Year :
2024

Abstract

Cross-View Geo-Localization tackles the challenge of image geo-localization in GNSS-denied environments, including disaster response scenarios, urban canyons, and dense forests, by matching street-view query images with geo-tagged aerial-view reference images. However, current research often relies on benchmarks and methods that assume center-aligned settings or account for only limited decentrality, which we define as the offset of the query image relative to the reference image center. Such assumptions fail to reflect real-world scenarios, where reference databases are typically pre-established without the possibility of ensuring perfect alignment for each query image. Moreover, decentrality is a critical factor warranting deeper investigation, as larger decentrality can substantially improve localization efficiency but comes at the cost of declines in localization accuracy. To address this limitation, we introduce DReSS (Decentrality Related Street-view and Satellite-view dataset), a novel dataset designed to evaluate cross-view geo-localization with a large geographic scope and diverse landscapes, emphasizing the decentrality issue. Meanwhile, we propose AuxGeo (Auxiliary Enhanced Geo-Localization) to further study the decentrality issue, which leverages a multi-metric optimization strategy with two novel modules: the Bird's-eye view Intermediary Module (BIM) and the Position Constraint Module (PCM). These modules improve the localization accuracy despite the decentrality problem. Extensive experiments demonstrate that AuxGeo outperforms previous methods on our proposed DReSS dataset, mitigating the issue of large decentrality, and also achieves state-of-the-art performance on existing public datasets such as CVUSA, CVACT, and VIGOR.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2412.11529
Document Type :
Working Paper