Long-short-term features for dynamic scene classification

Yuanjun Huang, Xianbin Cao, Qi Wang, Baochang Zhang, Xiantong Zhen, Xuelong Li

Research output: Contribution to journalArticlepeer-review

24 Scopus citations

Abstract

Dynamic scene classification has been extensively studied in computer vision due to its widespread applications. The key to dynamic scene classification lies in jointly characterizing spatial appearance and temporal dynamics to achieve informative representation, which remains an outstanding task in the literature. In this paper, we propose a unified framework to extract spatial and temporal features for dynamic scene representation. More specifically, we deploy two variants of deep convolutional neural networks to encode spatial appearance and short-term dynamics into short-term deep features (STDF). Based on STDF, we propose using the autoregressive moving average model to extract long-term frequency features (LTFF). By combining STDF and LTFF, we establish the long-short-term feature (LSTF) representations of dynamic scenes. The LSTF characterizes both spatial and temporal patterns of dynamic scenes for comprehensive and information representation that enables more accurate classification. Extensive experiments on three-dynamic scene classification benchmarks have shown that the proposed LSTF achieves high performance and substantially surpasses the state-of-the-art methods.

Original languageEnglish
Article number8331876
Pages (from-to)1038-1047
Number of pages10
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume29
Issue number4
DOIs
StatePublished - Apr 2019

Keywords

  • Dynamic scene classification
  • long term frequency feature
  • long-short term feature

Fingerprint

Dive into the research topics of 'Long-short-term features for dynamic scene classification'. Together they form a unique fingerprint.

Cite this