跳到主要导航 跳到搜索 跳到主要内容

Visual State Space Model Enhanced Features for UAV Geo-localization

  • Qi Liu
  • , Zhixiang Pei
  • , Yu Zhou
  • , Le Hui
  • , Yuchao Dai
  • , Mingyi He
  • Northwestern Polytechnical University Xian
  • Xi'an Institute of Surveying and Mapping

科研成果: 期刊稿件会议文章同行评审

摘要

The reliable operation of UAVs in complex environments relies on a strong positioning system, as conventional GNSS is susceptible to failure when signals are blocked or disrupted. As a result, Visual Place Recognition (VPR) has become a key technology for UAV localization. By matching the visual information captured by UAVs with the prebuilt satellite map database, UAVs can achieve geographical positioning. Traditional methods that rely on pre-trained networks for extracting global features for matching and retrieval are typically sensitive to visual appearance variations and prone to losing fine-grained information. To address this issue, we propose a UAV visual geolocalization method based on a dual-branch network, combining a pre-trained vision transformer model and a visual state space model to extract more robust features. Specifically, we design a dual-branch feature extraction network that integrates the DINOv2 and Mamba models to overcome challenges posed in appearance changes. It leverages the complementary strengths of both models to improving visual localization performance by combining global and local features. Additionally, we introduce an efficient, robust feature fusion framework inspired by the MLP-Mixer architecture to enhance the performance of multi-source feature representations. Experimental results on the ALTO and NewYorkFly datasets demonstrate that the proposed method outperforms existing methods in metrics such as R@1 and R@5. Notably, on the NewYorkFly dataset, R@1 improves by 6.3%. These results highlight the significant advantages of our method in UAV visual geo-localization tasks.

源语言英语
页(从-至)5628-5632
页数5
期刊International Geoscience and Remote Sensing Symposium (IGARSS)
DOI
出版状态已出版 - 2025
活动2025 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2025 - Brisbane, 澳大利亚
期限: 3 8月 20258 8月 2025

指纹

探究 'Visual State Space Model Enhanced Features for UAV Geo-localization' 的科研主题。它们共同构成独一无二的指纹。

引用此