TY - GEN
T1 - Enhancing Weakly Supervised Anomaly Detection in Surveillance Videos
T2 - 18th International Conference on Control, Automation, Robotics and Vision, ICARCV 2024
AU - Wu, Yinglong
AU - Mao, Zhaoyong
AU - Yu, Chenyang
AU - Liu, Guanglin
AU - Shen, Junge
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Aiming at the challenges of surveillance video anomaly detection(SVAD),especially the diversity and openness of its event types, we propose CLIP-Augmented Bimodal Memory Enhanced Network for weakly-supervised surveillance video anomaly detection. Specifically, we design a video feature extraction module based on CLIP feature, which significantly improves the ability to capture the semantic content of surveillance videos. Given the problem of semantic diversity of abnormal events, we further design a Bimodal Memory Unit(BMMU), which is used to enhance the model for all types of abnormal events by means of two kinds of memory module, storing the visual features and the textual descriptive features, in order to enhance the model's ability to remember and distinguish various types of anomalous features. Extensive experiments demonstrate that our approach achieves state-of-the-art performance on the UCF-Crime and XD-Violence benchmark datasets.
AB - Aiming at the challenges of surveillance video anomaly detection(SVAD),especially the diversity and openness of its event types, we propose CLIP-Augmented Bimodal Memory Enhanced Network for weakly-supervised surveillance video anomaly detection. Specifically, we design a video feature extraction module based on CLIP feature, which significantly improves the ability to capture the semantic content of surveillance videos. Given the problem of semantic diversity of abnormal events, we further design a Bimodal Memory Unit(BMMU), which is used to enhance the model for all types of abnormal events by means of two kinds of memory module, storing the visual features and the textual descriptive features, in order to enhance the model's ability to remember and distinguish various types of anomalous features. Extensive experiments demonstrate that our approach achieves state-of-the-art performance on the UCF-Crime and XD-Violence benchmark datasets.
UR - http://www.scopus.com/inward/record.url?scp=85217417005&partnerID=8YFLogxK
U2 - 10.1109/ICARCV63323.2024.10821687
DO - 10.1109/ICARCV63323.2024.10821687
M3 - 会议稿件
AN - SCOPUS:85217417005
T3 - 2024 18th International Conference on Control, Automation, Robotics and Vision, ICARCV 2024
SP - 756
EP - 762
BT - 2024 18th International Conference on Control, Automation, Robotics and Vision, ICARCV 2024
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 12 December 2024 through 15 December 2024
ER -