Human-Machine Cooperative Video Anomaly Detection

Fan Yang; Zhiwen Yu; Liming Chen; Jiaxi Gu; Qingyang Li; Bin Guo

doi:10.1145/3434183

Human-Machine Cooperative Video Anomaly Detection

Fan Yang, Zhiwen Yu, Liming Chen, Jiaxi Gu, Qingyang Li, Bin Guo

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

22 引用（Scopus）

摘要

It is still a challenge to detect anomalous events in video sequences in the field of computer vision due to heavy object occlusions, varying crowded densities and complex situations. To address this, we propose a novel human-machine cooperative approach which uses human feedback on anomaly confirmation to inform and enhance video anomaly detection. Specifically, we analyze the spatio-temporal characteristics of sequential frames of a video from the appearance and motion perspective from which spatial and temporal features are identified and extracted. We then develop a convolutional autoencoder neural network to compute an abnormal score based on reconstruction errors. In this process, a group of experts will provide human feedback to a certain proportion of classified frames to be incorporated into the model, and also the final judgment for the event anomalies for training and classification. The proposed approach is evaluated on 3 publicly available surveillance datasets, showing improved accuracy and competitive performance (93.7% AUC) with respect to the best performance (90.6% AUC) of the state-of-the-art approaches. The approach has not been previously seen to the best of our knowledge.

源语言	英语
文章编号	274
期刊	Proceedings of the ACM on Human-Computer Interaction
卷	4
期	CSCW3
DOI	https://doi.org/10.1145/3434183
出版状态	已出版 - 5 1月 2021

访问文件

10.1145/3434183

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{cc09b9b7a3a84260b127ef7b0c09cae4,

title = "Human-Machine Cooperative Video Anomaly Detection",

abstract = "It is still a challenge to detect anomalous events in video sequences in the field of computer vision due to heavy object occlusions, varying crowded densities and complex situations. To address this, we propose a novel human-machine cooperative approach which uses human feedback on anomaly confirmation to inform and enhance video anomaly detection. Specifically, we analyze the spatio-temporal characteristics of sequential frames of a video from the appearance and motion perspective from which spatial and temporal features are identified and extracted. We then develop a convolutional autoencoder neural network to compute an abnormal score based on reconstruction errors. In this process, a group of experts will provide human feedback to a certain proportion of classified frames to be incorporated into the model, and also the final judgment for the event anomalies for training and classification. The proposed approach is evaluated on 3 publicly available surveillance datasets, showing improved accuracy and competitive performance (93.7% AUC) with respect to the best performance (90.6% AUC) of the state-of-the-art approaches. The approach has not been previously seen to the best of our knowledge.",

keywords = "anomaly detection, autoencoder, human-machine, video frame",

author = "Fan Yang and Zhiwen Yu and Liming Chen and Jiaxi Gu and Qingyang Li and Bin Guo",

note = "Publisher Copyright: {\textcopyright} 2021 ACM.",

year = "2021",

month = jan,

day = "5",

doi = "10.1145/3434183",

language = "英语",

volume = "4",

journal = "Proceedings of the ACM on Human-Computer Interaction",

issn = "2573-0142",

publisher = "Association for Computing Machinery",

number = "CSCW3",

}

TY - JOUR

T1 - Human-Machine Cooperative Video Anomaly Detection

AU - Yang, Fan

AU - Yu, Zhiwen

AU - Chen, Liming

AU - Gu, Jiaxi

AU - Li, Qingyang

AU - Guo, Bin

PY - 2021/1/5

Y1 - 2021/1/5

N2 - It is still a challenge to detect anomalous events in video sequences in the field of computer vision due to heavy object occlusions, varying crowded densities and complex situations. To address this, we propose a novel human-machine cooperative approach which uses human feedback on anomaly confirmation to inform and enhance video anomaly detection. Specifically, we analyze the spatio-temporal characteristics of sequential frames of a video from the appearance and motion perspective from which spatial and temporal features are identified and extracted. We then develop a convolutional autoencoder neural network to compute an abnormal score based on reconstruction errors. In this process, a group of experts will provide human feedback to a certain proportion of classified frames to be incorporated into the model, and also the final judgment for the event anomalies for training and classification. The proposed approach is evaluated on 3 publicly available surveillance datasets, showing improved accuracy and competitive performance (93.7% AUC) with respect to the best performance (90.6% AUC) of the state-of-the-art approaches. The approach has not been previously seen to the best of our knowledge.

AB - It is still a challenge to detect anomalous events in video sequences in the field of computer vision due to heavy object occlusions, varying crowded densities and complex situations. To address this, we propose a novel human-machine cooperative approach which uses human feedback on anomaly confirmation to inform and enhance video anomaly detection. Specifically, we analyze the spatio-temporal characteristics of sequential frames of a video from the appearance and motion perspective from which spatial and temporal features are identified and extracted. We then develop a convolutional autoencoder neural network to compute an abnormal score based on reconstruction errors. In this process, a group of experts will provide human feedback to a certain proportion of classified frames to be incorporated into the model, and also the final judgment for the event anomalies for training and classification. The proposed approach is evaluated on 3 publicly available surveillance datasets, showing improved accuracy and competitive performance (93.7% AUC) with respect to the best performance (90.6% AUC) of the state-of-the-art approaches. The approach has not been previously seen to the best of our knowledge.

KW - anomaly detection

KW - autoencoder

KW - human-machine

KW - video frame

UR - http://www.scopus.com/inward/record.url?scp=85175639498&partnerID=8YFLogxK

U2 - 10.1145/3434183

DO - 10.1145/3434183

M3 - 文章

AN - SCOPUS:85175639498

SN - 2573-0142

VL - 4

JO - Proceedings of the ACM on Human-Computer Interaction

JF - Proceedings of the ACM on Human-Computer Interaction

IS - CSCW3

M1 - 274

ER -

Human-Machine Cooperative Video Anomaly Detection

摘要

访问文件

其它文件与链接

指纹

引用此