Text-based Person Search in Full Images via Semantic-Driven Proposal Generation

Shizhou Zhang, De Cheng, Wenlong Luo, Yinghui Xing, Duo Long, Hao Li, Kai Niu, Guoqiang Liang, Yanning Zhang

科研成果: 书/报告/会议事项章节会议稿件同行评审

5 引用 (Scopus)

摘要

Finding target persons in full scene images with a query of text description has important practical applications in intelligent video surveillance. However, different from the real-world scenarios where the bounding boxes are not available, existing text-based person re- trieval methods mainly focus on the cross modal matching between the query text descriptions and the gallery of cropped pedestrian images. To close the gap, we study the problem of text-based person search in full images by proposing a new end-to-end learning framework which jointly optimize the pedestrian detection, identification and visual-semantic feature embedding tasks. To take full advantage of the query text, the semantic features are leveraged to instruct the Region Proposal Network to pay more attention to the text-described proposals. Besides, a cross-scale visual-semantic embedding mechanism is utilized to improve the performance. To validate the proposed method, we collect and annotate two large-scale benchmark datasets based on the widely adopted image-based person search datasets CUHK-SYSU and PRW. Comprehensive experiments are conducted on the two datasets and compared with the baseline methods, our method achieves the state-of-the-art performance.

源语言英语
主期刊名HCMA 2023 - Proceedings of the 4th International Workshop on Human-centric Multimedia Analysis, Co-located with
主期刊副标题MM 2023
出版商Association for Computing Machinery, Inc
5-14
页数10
ISBN(电子版)9798400702723
DOI
出版状态已出版 - 2 11月 2023
活动4th International Workshop on Human-centric Multimedia Analysis, HCMA 2023 - Ottawa, 加拿大
期限: 2 11月 2023 → …

出版系列

姓名HCMA 2023 - Proceedings of the 4th International Workshop on Human-centric Multimedia Analysis, Co-located with: MM 2023

会议

会议4th International Workshop on Human-centric Multimedia Analysis, HCMA 2023
国家/地区加拿大
Ottawa
时期2/11/23 → …

指纹

探究 'Text-based Person Search in Full Images via Semantic-Driven Proposal Generation' 的科研主题。它们共同构成独一无二的指纹。

引用此