Abstract
Dynamic scene classification has been extensively studied in computer vision due to its widespread applications. The key to dynamic scene classification lies in jointly characterizing spatial appearance and temporal dynamics to achieve informative representation, which remains an outstanding task in the literature. In this paper, we propose a unified framework to extract spatial and temporal features for dynamic scene representation. More specifically, we deploy two variants of deep convolutional neural networks to encode spatial appearance and short-term dynamics into short-term deep features (STDF). Based on STDF, we propose using the autoregressive moving average model to extract long-term frequency features (LTFF). By combining STDF and LTFF, we establish the long-short-term feature (LSTF) representations of dynamic scenes. The LSTF characterizes both spatial and temporal patterns of dynamic scenes for comprehensive and information representation that enables more accurate classification. Extensive experiments on three-dynamic scene classification benchmarks have shown that the proposed LSTF achieves high performance and substantially surpasses the state-of-the-art methods.
| Original language | English |
|---|---|
| Article number | 8331876 |
| Pages (from-to) | 1038-1047 |
| Number of pages | 10 |
| Journal | IEEE Transactions on Circuits and Systems for Video Technology |
| Volume | 29 |
| Issue number | 4 |
| DOIs | |
| State | Published - Apr 2019 |
Keywords
- Dynamic scene classification
- long term frequency feature
- long-short term feature
Fingerprint
Dive into the research topics of 'Long-short-term features for dynamic scene classification'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver