跳到主要导航 跳到搜索 跳到主要内容

Enhancing Visible-Infrared Person Re-identification with Modality- and Instance-aware Visual Prompt Learning

  • Ruiqi Wu
  • , Bingliang Jiao
  • , Wenxuan Wang
  • , Meng Liu
  • , Peng Wang
  • Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节会议稿件同行评审

15 引用 (Scopus)

摘要

The Visible-Infrared Person Re-identification (VI ReID) aims to match visible and infrared images of the same pedestrians across non-overlapped camera views. These two input modalities contain both invariant information, such as shape, and modality-specific details, such as color. An ideal model should utilize valuable information from both modalities during training for enhanced representational capability. However, the gap caused by modality-specific information poses substantial challenges for the VI ReID model to handle distinct modality inputs simultaneously. To address this, we introduce the Modality-aware and Instance-aware Visual Prompts (MIP) network in our work, designed to effectively utilize both invariant and specific information for identification. Specifically, our MIP model is built on the transformer architecture. In this model, we have designed a series of modality-specific prompts, which could enable our model to adapt to and make use of the specific information inherent in different modality inputs, thereby reducing the interference caused by the modality gap and achieving better identification. Besides, we also employ each pedestrian feature to construct a group of instance-specific prompts. These customized prompts are responsible for guiding our model to adapt to each pedestrian instance dynamically, thereby capturing identity-level discriminative clues for identification. Through extensive experiments on SYSU-MM01 and RegDB datasets, the effectiveness of both our designed modules is evaluated. Additionally, our proposed MIP performs better than most state-of-the-art methods.

源语言英语
主期刊名ICMR 2024-Proceedings of the 14th Annual ACM International Conference on Multimedia Retrieval
出版商Association for Computing Machinery, Inc
579-588
页数10
ISBN(电子版)9798400706028
DOI
出版状态已出版 - 7 6月 2024
已对外发布
活动14th Annual ACM International Conference on Multimedia Retrieval, ICMR 2024 - Phuket, 泰国
期限: 10 6月 202414 6月 2024

出版系列

姓名ICMR 2024 - Proceedings of the 2024 International Conference on Multimedia Retrieval

会议

会议14th Annual ACM International Conference on Multimedia Retrieval, ICMR 2024
国家/地区泰国
Phuket
时期10/06/2414/06/24

指纹

探究 'Enhancing Visible-Infrared Person Re-identification with Modality- and Instance-aware Visual Prompt Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此