Learning to detect anomaly events in crowd scenes from synthetic data

Wei Lin; Junyu Gao; Qi Wang; Xuelong Li

doi:10.1016/j.neucom.2021.01.031

Learning to detect anomaly events in crowd scenes from synthetic data

Wei Lin, Junyu Gao, Qi Wang, Xuelong Li

School of Artificial Intelligence, OPtics and Electronics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

48 Scopus citations

Abstract

Recently, due to its widespread applications in public safety, anomaly detection in crowd scenes has become a hot topic. Some deep-learning-based methods attain significant achievements in this field. Nevertheless, most of them suffer from over-fitting to some extent because of scarce data, which are usually abrupt and low-frequency in the real world. To remedy the above problem, this paper firstly develops a synthetic anomaly event generating system, which could simulate typical specific abnormal events. By utilizing this system, a large synthetic, diverse anomaly event dataset is built, which contains 2,149 video sequences. After getting the dataset, a 3D CNN is designed to detect the abnormal types at the video level. However, we find that there are obvious domain differences (also named as “domain gap/shifts”) between synthetic videos and real-world data, which results in performance degradation when applying the model to the real world. Thus, this paper further proposes a cyclic 3D GAN for domain adaption to reduce the domain gap, which translates the synthetic data to the photorealistic video sequences. Then the detection model is trained on the translated data and it can perform well in the real data. Experimental results illustrate that the proposed method outperforms these baselines for the domain adaptation anomaly detection.

Original language	English
Pages (from-to)	248-259
Number of pages	12
Journal	Neurocomputing
Volume	436
DOIs	https://doi.org/10.1016/j.neucom.2021.01.031
State	Published - 14 May 2021

Keywords

Anomaly detection
Generative model
Human behavior analysis
Synthetic data
Video classification

Access to Document

10.1016/j.neucom.2021.01.031

Cite this

@article{8643b05a03f247788b08a79a8af2a64f,

title = "Learning to detect anomaly events in crowd scenes from synthetic data",

abstract = "Recently, due to its widespread applications in public safety, anomaly detection in crowd scenes has become a hot topic. Some deep-learning-based methods attain significant achievements in this field. Nevertheless, most of them suffer from over-fitting to some extent because of scarce data, which are usually abrupt and low-frequency in the real world. To remedy the above problem, this paper firstly develops a synthetic anomaly event generating system, which could simulate typical specific abnormal events. By utilizing this system, a large synthetic, diverse anomaly event dataset is built, which contains 2,149 video sequences. After getting the dataset, a 3D CNN is designed to detect the abnormal types at the video level. However, we find that there are obvious domain differences (also named as “domain gap/shifts”) between synthetic videos and real-world data, which results in performance degradation when applying the model to the real world. Thus, this paper further proposes a cyclic 3D GAN for domain adaption to reduce the domain gap, which translates the synthetic data to the photorealistic video sequences. Then the detection model is trained on the translated data and it can perform well in the real data. Experimental results illustrate that the proposed method outperforms these baselines for the domain adaptation anomaly detection.",

keywords = "Anomaly detection, Generative model, Human behavior analysis, Synthetic data, Video classification",

author = "Wei Lin and Junyu Gao and Qi Wang and Xuelong Li",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier B.V.",

year = "2021",

month = may,

day = "14",

doi = "10.1016/j.neucom.2021.01.031",

language = "英语",

volume = "436",

pages = "248--259",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Learning to detect anomaly events in crowd scenes from synthetic data

AU - Lin, Wei

AU - Gao, Junyu

AU - Wang, Qi

AU - Li, Xuelong

PY - 2021/5/14

Y1 - 2021/5/14

N2 - Recently, due to its widespread applications in public safety, anomaly detection in crowd scenes has become a hot topic. Some deep-learning-based methods attain significant achievements in this field. Nevertheless, most of them suffer from over-fitting to some extent because of scarce data, which are usually abrupt and low-frequency in the real world. To remedy the above problem, this paper firstly develops a synthetic anomaly event generating system, which could simulate typical specific abnormal events. By utilizing this system, a large synthetic, diverse anomaly event dataset is built, which contains 2,149 video sequences. After getting the dataset, a 3D CNN is designed to detect the abnormal types at the video level. However, we find that there are obvious domain differences (also named as “domain gap/shifts”) between synthetic videos and real-world data, which results in performance degradation when applying the model to the real world. Thus, this paper further proposes a cyclic 3D GAN for domain adaption to reduce the domain gap, which translates the synthetic data to the photorealistic video sequences. Then the detection model is trained on the translated data and it can perform well in the real data. Experimental results illustrate that the proposed method outperforms these baselines for the domain adaptation anomaly detection.

AB - Recently, due to its widespread applications in public safety, anomaly detection in crowd scenes has become a hot topic. Some deep-learning-based methods attain significant achievements in this field. Nevertheless, most of them suffer from over-fitting to some extent because of scarce data, which are usually abrupt and low-frequency in the real world. To remedy the above problem, this paper firstly develops a synthetic anomaly event generating system, which could simulate typical specific abnormal events. By utilizing this system, a large synthetic, diverse anomaly event dataset is built, which contains 2,149 video sequences. After getting the dataset, a 3D CNN is designed to detect the abnormal types at the video level. However, we find that there are obvious domain differences (also named as “domain gap/shifts”) between synthetic videos and real-world data, which results in performance degradation when applying the model to the real world. Thus, this paper further proposes a cyclic 3D GAN for domain adaption to reduce the domain gap, which translates the synthetic data to the photorealistic video sequences. Then the detection model is trained on the translated data and it can perform well in the real data. Experimental results illustrate that the proposed method outperforms these baselines for the domain adaptation anomaly detection.

KW - Anomaly detection

KW - Generative model

KW - Human behavior analysis

KW - Synthetic data

KW - Video classification

UR - http://www.scopus.com/inward/record.url?scp=85100442169&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2021.01.031

DO - 10.1016/j.neucom.2021.01.031

M3 - 文章

AN - SCOPUS:85100442169

SN - 0925-2312

VL - 436

SP - 248

EP - 259

JO - Neurocomputing

JF - Neurocomputing

ER -

Learning to detect anomaly events in crowd scenes from synthetic data

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this