MRRNet: Learning multiple region representation for video person re-identification

Hui Fu, Ke Zhang, Haoyu Li, Jingyu Wang

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

Video person re-identification is a crucial component of a robust surveillance system. Within a video clip, different human regions exhibit unique stability characteristics, which would be harmful to generating a discriminative representation. Unfortunately, prior works cannot effectively deal with the stability characteristics of different regions. To tackle this problem, we propose a Multiple Region Representation Network (MRRNet) that aims to discover the discriminative information from different human regions. Firstly, a Stable Region Representation (SRR) layer is proposed to capture important clues from the stable regions and exchange temporal information by cross-relation aware operation. Secondly, a Multiple Region Representation (MRR) layer is designed to address the unstable regions and preserve the attention on stable regions. Thirdly, SRR and MRR can be conveniently inserted into multiple stages of the deep residual networks and significantly improve the performance of the network. Comprehensive experiments validate the effectiveness of our network. Particularly, MRRNet achieves 86.7% mAP and 91.1% Rank-1 accuracy on the MARS dataset, which outperforms state-of-the-arts.

Original languageEnglish
Article number105108
JournalEngineering Applications of Artificial Intelligence
Volume114
DOIs
StatePublished - Sep 2022

Keywords

  • Cross-relation aware
  • Multiple region representation
  • Self-relation aware
  • Stable region representation
  • Video person re-identification

Fingerprint

Dive into the research topics of 'MRRNet: Learning multiple region representation for video person re-identification'. Together they form a unique fingerprint.

Cite this