跳到主要导航 跳到搜索 跳到主要内容

Transformer-Based Few-Shot Object Detection with Multi-Relation Matching for Remote Sensing Images

  • Lefan Wang
  • , Jiawei Lian
  • , Yan Feng
  • , Xiaoning Chen
  • , Shaohui Mei
  • Northwestern Polytechnical University Xian

科研成果: 会议稿件论文同行评审

2 引用 (Scopus)

摘要

Few-shot object detection (FSOD) on remote sensing images (RSIs) has garnered significant research interest due to its ability to detect novel classes using very few training examples from challenging remote sensing scenarios. Meta-learning FSOD methods, based on Faster R-CNN and YOLO structures, utilize a two-branch Siamese network as the backbone and compute the similarity between image regions for effective detection. However, almost all methods rely on extracting features using convolutional neural networks (CNNs). Inspired by the improved performance of transformer backbones for downstream tasks, a transformer-based FSOD method is proposed, which employs a transformer backbone with asymmetric-batched cross-attention for the two-branch feature extraction. Our model can improve the classification performance by introducing a Multi-Relation Matching (MRM) head for FSOD to enhance the similarity relation matching learning between two branches. Comprehensive experiments on DIOR benchmarks demonstrate the effectiveness of our model.

源语言英语
8046-8049
页数4
DOI
出版状态已出版 - 2024
活动2024 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2024 - Athens, 希腊
期限: 7 7月 202412 7月 2024

会议

会议2024 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2024
国家/地区希腊
Athens
时期7/07/2412/07/24

指纹

探究 'Transformer-Based Few-Shot Object Detection with Multi-Relation Matching for Remote Sensing Images' 的科研主题。它们共同构成独一无二的指纹。

引用此