TY - JOUR
T1 - Learning to detect anomaly events in crowd scenes from synthetic data
AU - Lin, Wei
AU - Gao, Junyu
AU - Wang, Qi
AU - Li, Xuelong
N1 - Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2021/5/14
Y1 - 2021/5/14
N2 - Recently, due to its widespread applications in public safety, anomaly detection in crowd scenes has become a hot topic. Some deep-learning-based methods attain significant achievements in this field. Nevertheless, most of them suffer from over-fitting to some extent because of scarce data, which are usually abrupt and low-frequency in the real world. To remedy the above problem, this paper firstly develops a synthetic anomaly event generating system, which could simulate typical specific abnormal events. By utilizing this system, a large synthetic, diverse anomaly event dataset is built, which contains 2,149 video sequences. After getting the dataset, a 3D CNN is designed to detect the abnormal types at the video level. However, we find that there are obvious domain differences (also named as “domain gap/shifts”) between synthetic videos and real-world data, which results in performance degradation when applying the model to the real world. Thus, this paper further proposes a cyclic 3D GAN for domain adaption to reduce the domain gap, which translates the synthetic data to the photorealistic video sequences. Then the detection model is trained on the translated data and it can perform well in the real data. Experimental results illustrate that the proposed method outperforms these baselines for the domain adaptation anomaly detection.
AB - Recently, due to its widespread applications in public safety, anomaly detection in crowd scenes has become a hot topic. Some deep-learning-based methods attain significant achievements in this field. Nevertheless, most of them suffer from over-fitting to some extent because of scarce data, which are usually abrupt and low-frequency in the real world. To remedy the above problem, this paper firstly develops a synthetic anomaly event generating system, which could simulate typical specific abnormal events. By utilizing this system, a large synthetic, diverse anomaly event dataset is built, which contains 2,149 video sequences. After getting the dataset, a 3D CNN is designed to detect the abnormal types at the video level. However, we find that there are obvious domain differences (also named as “domain gap/shifts”) between synthetic videos and real-world data, which results in performance degradation when applying the model to the real world. Thus, this paper further proposes a cyclic 3D GAN for domain adaption to reduce the domain gap, which translates the synthetic data to the photorealistic video sequences. Then the detection model is trained on the translated data and it can perform well in the real data. Experimental results illustrate that the proposed method outperforms these baselines for the domain adaptation anomaly detection.
KW - Anomaly detection
KW - Generative model
KW - Human behavior analysis
KW - Synthetic data
KW - Video classification
UR - http://www.scopus.com/inward/record.url?scp=85100442169&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2021.01.031
DO - 10.1016/j.neucom.2021.01.031
M3 - 文章
AN - SCOPUS:85100442169
SN - 0925-2312
VL - 436
SP - 248
EP - 259
JO - Neurocomputing
JF - Neurocomputing
ER -