基于双分支特征聚合的无人机视觉位置识别

Translated title of the contribution: Dual-branch feature aggregation for UAV visual place recognition
  • Qi Liu
  • , Zhixiang Pei
  • , Le Hui
  • , Mingyi He
  • , Yuchao Dai

Research output: Contribution to journalArticlepeer-review

Abstract

UAVs’ reliance on Global Navigation Satellite Systems (GNSS( for navigation and positioning is prone to failure due to signal blockage or interference. Visual Place Recognition (VPR( enables geographic localization by matching the visual information captured by UAVs with pre-built map data, providing reliable positioning information in GNSS-denied environments, and thus become a research hotspot in recent years. Traditional VPR methods typically depend on pre-trained networks to extract global features for matching and retrieval, but they are sensitive to changes in visual appearance such as viewpoint, scale, and lighting, and are prone to losing fine-grained information. To address these issues, this paper proposes a UAV visual geo-localization method based on a dual-branch feature aggregation network that combines a pre-trained Vision Transformer model and a state-space model to extract more robust features. Specifically, a dual-branch feature extraction network integrating the DINOv2 and VMamba models is designed, which leverages the global semantic understanding of ViT and the local dynamic modeling capability of the visual state-space model to achieve stronger generalization and fine-grained perception. Additionally, the method introduces an efficient feature fusion framework inspired by the MLP-Mixer architecture to enhance the performance of multichannel feature representation. Experiments conducted on the same-view ALTO dataset and the cross-view VIGOR dataset demonstrate that the proposed method achieves high accuracy in metrics such as R@1 and R@5, outperforming existing methods. This method is proved effective in identifying matching images in different scenarios.

Translated title of the contributionDual-branch feature aggregation for UAV visual place recognition
Original languageChinese (Traditional)
Pages (from-to)1-12
Number of pages12
JournalHangkong Xuebao/Acta Aeronautica et Astronautica Sinica
Volume46
Issue number23
DOIs
StatePublished - 2025

Fingerprint

Dive into the research topics of 'Dual-branch feature aggregation for UAV visual place recognition'. Together they form a unique fingerprint.

Cite this