TY - JOUR
T1 - Long-short-term features for dynamic scene classification
AU - Huang, Yuanjun
AU - Cao, Xianbin
AU - Wang, Qi
AU - Zhang, Baochang
AU - Zhen, Xiantong
AU - Li, Xuelong
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2019/4
Y1 - 2019/4
N2 - Dynamic scene classification has been extensively studied in computer vision due to its widespread applications. The key to dynamic scene classification lies in jointly characterizing spatial appearance and temporal dynamics to achieve informative representation, which remains an outstanding task in the literature. In this paper, we propose a unified framework to extract spatial and temporal features for dynamic scene representation. More specifically, we deploy two variants of deep convolutional neural networks to encode spatial appearance and short-term dynamics into short-term deep features (STDF). Based on STDF, we propose using the autoregressive moving average model to extract long-term frequency features (LTFF). By combining STDF and LTFF, we establish the long-short-term feature (LSTF) representations of dynamic scenes. The LSTF characterizes both spatial and temporal patterns of dynamic scenes for comprehensive and information representation that enables more accurate classification. Extensive experiments on three-dynamic scene classification benchmarks have shown that the proposed LSTF achieves high performance and substantially surpasses the state-of-the-art methods.
AB - Dynamic scene classification has been extensively studied in computer vision due to its widespread applications. The key to dynamic scene classification lies in jointly characterizing spatial appearance and temporal dynamics to achieve informative representation, which remains an outstanding task in the literature. In this paper, we propose a unified framework to extract spatial and temporal features for dynamic scene representation. More specifically, we deploy two variants of deep convolutional neural networks to encode spatial appearance and short-term dynamics into short-term deep features (STDF). Based on STDF, we propose using the autoregressive moving average model to extract long-term frequency features (LTFF). By combining STDF and LTFF, we establish the long-short-term feature (LSTF) representations of dynamic scenes. The LSTF characterizes both spatial and temporal patterns of dynamic scenes for comprehensive and information representation that enables more accurate classification. Extensive experiments on three-dynamic scene classification benchmarks have shown that the proposed LSTF achieves high performance and substantially surpasses the state-of-the-art methods.
KW - Dynamic scene classification
KW - long term frequency feature
KW - long-short term feature
UR - http://www.scopus.com/inward/record.url?scp=85045210813&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2018.2823360
DO - 10.1109/TCSVT.2018.2823360
M3 - 文章
AN - SCOPUS:85045210813
SN - 1051-8215
VL - 29
SP - 1038
EP - 1047
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 4
M1 - 8331876
ER -