Skip to main navigation Skip to search Skip to main content

Breakthrough in fine-grained video anomaly detection on highway: New benchmark and model

  • Chenlin Meng
  • , Xin Wang
  • , Chi Zhang
  • , Zhaoyong Mao
  • , Junge Shen
  • , Zhiyong Cheng
  • Northwestern Polytechnical University Xian
  • Shaanxi Transportation Holding Group
  • Hefei University of Technology

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Automatic detection of anomalies is critical for surveillance video analysis, especially for highway videos. In contrast to traditional coarse-grained methods, which only focus on identifying abnormal video clips, fine-grained detection more effectively meets practical demands. Because it can further classify the specific anomaly types which helps monitoring systems issue targeted alerts. Nevertheless, existing research often overlooks the semantic information in video descriptions, which is essential for capturing the fine contextual relationship between anomalies. Therefore, it is extremely necessary to explore how to utilize the description information to enhance fine-grained video anomaly detection. To achieve this, we construct the first Highway Anomaly Dataset (HAD) containing video descriptions and propose a novel multi-modal training paradigm called Dual-Classification with Dual-Text (DCDT). It integrates coarse-grained binary classification and fine-grained multi-class classification tasks, involving both description text and label text. In DCDT, we leverage video-description alignment to increase the intra-class similarity for normal events and inter-class differences among anomalies, while adopting video-label alignment to distinguish between normal and various types of anomalies. Experimental results demonstrate that DCDT achieves superior performance on our proposed HAD benchmark, as well as on the widely-used UCF-Crime and XD-Violence datasets. Furthermore, the HAD dataset provides an innovative platform for research in video analysis and multi-modal learning.

Original languageEnglish
Article number129127
JournalExpert Systems with Applications
Volume296
DOIs
StatePublished - 15 Jan 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 16 - Peace, Justice and Strong Institutions
    SDG 16 Peace, Justice and Strong Institutions

Keywords

  • Highway anomaly dataset
  • Multi-modal learning
  • Surveillance camera
  • Video analysis
  • Video anomaly detection

Fingerprint

Dive into the research topics of 'Breakthrough in fine-grained video anomaly detection on highway: New benchmark and model'. Together they form a unique fingerprint.

Cite this