A Flow Base Bi-path Network for Cross-Scene Video Crowd Understanding in Aerial View

Zhiyuan Zhao, Tao Han, Junyu Gao, Qi Wang, Xuelong Li

科研成果: 书/报告/会议事项章节会议稿件同行评审

11 引用 (Scopus)

摘要

Drones shooting can be applied in dynamic traffic monitoring, object detecting and tracking, and other vision tasks. The variability of the shooting location adds some intractable challenges to these missions, such as varying scale, unstable exposure, and scene migration. In this paper, we strive to tackle the above challenges and automatically understand the crowd from the visual data collected from drones. First, to alleviate the background noise generated in cross-scene testing, a double-stream crowd counting model is proposed, which extracts optical flow and frame difference information as an additional branch. Besides, to improve the model’s generalization ability at different scales and time, we randomly combine a variety of data transformation methods to simulate some unseen environments. To tackle the crowd density estimation problem under extreme dark environments, we introduce synthetic data generated by game Grand Theft Auto V(GTAV). Experiment results show the effectiveness of the virtual data. Our method wins the challenge with a mean absolute error (MAE) of 12.701. Moreover, a comprehensive ablation study is conducted to explore each component’s contribution.

源语言英语
主期刊名Computer Vision – ECCV 2020 Workshops, Proceedings
编辑Adrien Bartoli, Andrea Fusiello
出版商Springer Science and Business Media Deutschland GmbH
574-587
页数14
ISBN(印刷版)9783030668228
DOI
出版状态已出版 - 2020
活动Workshops held at the 16th European Conference on Computer Vision, ECCV 2020 - Glasgow, 英国
期限: 23 8月 202028 8月 2020

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
12538 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议Workshops held at the 16th European Conference on Computer Vision, ECCV 2020
国家/地区英国
Glasgow
时期23/08/2028/08/20

指纹

探究 'A Flow Base Bi-path Network for Cross-Scene Video Crowd Understanding in Aerial View' 的科研主题。它们共同构成独一无二的指纹。

引用此