TY - GEN
T1 - A Novel 3D Convolutional Neural Network for Action Recognition in Infrared Videos
AU - Nie, Jiahao
AU - Yan, Longbin
AU - Wang, Xiuheng
AU - Chen, Jie
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021
Y1 - 2021
N2 - As Infrared (IR) imaging is sensitive to objects emitting heat, it is more useful to distinguish people from the background compared with visible spectrum imaging, especially in poor light enviornment. Recently, IR images have attracted increasing attention in action recognition, for which Convolutional Neural Networks (ConvNets) have achieved great success in both aspects of performance and speed. Compared to 2D ConvNets, 3D ConvNets are more powerful tools for action recognition which can jointly extract features from temporal and spatial domains and combine these together to enhance the performance. In this paper, we propose a novel 3D ConvNet with a deep architecture to realize action recognition in IR videos. Besides, a residual fully connected (FC) module is introduced after the ConvNet backbone to improve the performance. Furthermore, we employe a transfer learning strategy, i.e., the proposed 3D ConvNet is pretrained on a large-scale visible spectrum dataset and then finetuned with false-color version of IR images to generalize well in the action recognition task. Experimental results demonstrate the superiority of the proposed method in Average Precision comparisons.
AB - As Infrared (IR) imaging is sensitive to objects emitting heat, it is more useful to distinguish people from the background compared with visible spectrum imaging, especially in poor light enviornment. Recently, IR images have attracted increasing attention in action recognition, for which Convolutional Neural Networks (ConvNets) have achieved great success in both aspects of performance and speed. Compared to 2D ConvNets, 3D ConvNets are more powerful tools for action recognition which can jointly extract features from temporal and spatial domains and combine these together to enhance the performance. In this paper, we propose a novel 3D ConvNet with a deep architecture to realize action recognition in IR videos. Besides, a residual fully connected (FC) module is introduced after the ConvNet backbone to improve the performance. Furthermore, we employe a transfer learning strategy, i.e., the proposed 3D ConvNet is pretrained on a large-scale visible spectrum dataset and then finetuned with false-color version of IR images to generalize well in the action recognition task. Experimental results demonstrate the superiority of the proposed method in Average Precision comparisons.
KW - action recognition
KW - convolutional neural network
KW - infrared image
KW - transfer learning
UR - http://www.scopus.com/inward/record.url?scp=85123166940&partnerID=8YFLogxK
U2 - 10.1109/ICICSP54369.2021.9611896
DO - 10.1109/ICICSP54369.2021.9611896
M3 - 会议稿件
AN - SCOPUS:85123166940
T3 - 2021 4th International Conference on Information Communication and Signal Processing, ICICSP 2021
SP - 420
EP - 424
BT - 2021 4th International Conference on Information Communication and Signal Processing, ICICSP 2021
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 4th International Conference on Information Communication and Signal Processing, ICICSP 2021
Y2 - 24 September 2021 through 26 September 2021
ER -