跳到主要导航 跳到搜索 跳到主要内容

Beyond Vision: A Semantic Reasoning Enhanced Model for Gesture Recognition with Improved Spatiotemporal Capacity

科研成果: 书/报告/会议事项章节会议稿件同行评审

3 引用 (Scopus)

摘要

Gesture recognition is an imperative and practical problem owing to its great application potential. Although recent works have made great progress in this field, there also exist three non-negligible problems: 1) existing works lack efficient temporal modeling ability; 2) existing works lack effective spatial attention capacity; 3) most works only focus on the visual information, without considering the semantic relationship between different classes. To tackle the first problem, we propose a Long and Short-term Temporal Shift Module (LS-TSM). It extends the original TSM and expands the step size of shift operation to model long-term and short-term temporal information simultaneously. For the second problem, we expect to focus on the spatial area where the change of hand mainly occurs. Therefore, we propose a Spatial Attention Module (SAM) which utilizes the RGB difference between frames to get a spatial attention mask to assign different weights to different spatial positions. As for the last, we propose a Label Relation Module (LRM) which can take full advantage of the relationship among classes based on their labels’ semantic information. With the proposed modules, our work achieves the state-of-the-art performance on two commonly used gesture datasets, i.e., EgoGesture and NVGesture datasets. Extensive experiments demonstrate the effectiveness of our proposed modules.

源语言英语
主期刊名Pattern Recognition and Computer Vision - 5th Chinese Conference, PRCV 2022, Proceedings
编辑Shiqi Yu, Jianguo Zhang, Zhaoxiang Zhang, Tieniu Tan, Pong C. Yuen, Yike Guo, Junwei Han, Jianhuang Lai
出版商Springer Science and Business Media Deutschland GmbH
420-434
页数15
ISBN(印刷版)9783031189128
DOI
出版状态已出版 - 2022
活动5th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2022 - Shenzhen, 中国
期限: 4 11月 20227 11月 2022

出版系列

姓名Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
13536 LNCS
ISSN(印刷版)0302-9743
ISSN(电子版)1611-3349

会议

会议5th Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2022
国家/地区中国
Shenzhen
时期4/11/227/11/22

指纹

探究 'Beyond Vision: A Semantic Reasoning Enhanced Model for Gesture Recognition with Improved Spatiotemporal Capacity' 的科研主题。它们共同构成独一无二的指纹。

引用此