Video Action Recognition Based on Deeper Convolution Networks with Pair-Wise Frame Motion Concatenation

Yamin Han, Peng Zhang, Tao Zhuo, Wei Huang, Yanning Zhang

科研成果: 书/报告/会议事项章节会议稿件同行评审

7 引用 (Scopus)

摘要

Deep convolution networks based strategies have shown a remarkable performance in different recognition tasks. Unfortunately, in a variety of realistic scenarios, accurate and robust recognition is hard especially for the videos. Different challenges such as cluttered backgrounds or viewpoint change etc. may generate the problem like large intrinsic and extrinsic class variations. In addition, the problem of data deficiency could also make the designed model degrade during learning and update. Therefore, an effective way by incorporating the frame-wise motion into the learning model on-the-fly has become more and more attractive in contemporary video analysis studies. To overcome those limitations, in this work, we proposed a deeper convolution networks based approach with pairwise motion concatenation, which is named deep temporal convolutional networks. In this work, a temporal motion accumulation mechanism has been introduced as an effective data entry for the learning of convolution networks. Specifically, to handle the possible data deficiency, beneficial practices of transferring ResNet-101 weights and data variation augmentation are also utilized for the purpose of robust recognition. Experiments on challenging dataset UCF101 and ODAR dataset have verified a preferable performance when compared with other state-of-art works.

源语言英语
主期刊名Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2017
出版商IEEE Computer Society
1226-1235
页数10
ISBN(电子版)9781538607336
DOI
出版状态已出版 - 22 8月 2017
活动30th IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2017 - Honolulu, 美国
期限: 21 7月 201726 7月 2017

出版系列

姓名IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
2017-July
ISSN(印刷版)2160-7508
ISSN(电子版)2160-7516

会议

会议30th IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2017
国家/地区美国
Honolulu
时期21/07/1726/07/17

指纹

探究 'Video Action Recognition Based on Deeper Convolution Networks with Pair-Wise Frame Motion Concatenation' 的科研主题。它们共同构成独一无二的指纹。

引用此