Micro-gesture Online Recognition with Dual-stream Multi-scale Transformer in Long Videos

Yuhan Wang, Ke Rui Linghu, Hexiang Huang, Zhaoqiang Xia

科研成果: 期刊稿件会议文章同行评审

摘要

Micro-gestures are increasingly recognized as a key indicator in the field of emotion analysis and have garnered growing interest within the field. The majority of research efforts have been directed towards the classification of micro-gestures, which entails predicting their categories. However, comparatively fewer studies have been dedicated to the detection of micro-gestures. Micro-gesture online recognition (spotting), which involves predicting both the temporal position and the category, is a preliminary step for classification but has received limited attention. In this context, we construct a deep network with dual-stream input for micro-gesture online recognition. Specifically, we utilize a sequential action recognition model to extract motion features from RGB and skeleton sequences separately, which are then processed by the multi-scale Transformer encoder as detection model. The proposed network are trained in a two-stage strategy and combined to perform the temporal spotting. Our proposed method is validated on the SMG dataset and has achieved the first ranking in the task of online recognition from the MiGA2024 Challenge Track 2.

源语言英语
期刊CEUR Workshop Proceedings
3848
出版状态已出版 - 2024
活动2024 IJCAI Workshop and Challenge on Micro-Gesture Analysis for Hidden Emotion Understanding, MiGA 2024 - Jeju, 韩国
期限: 4 8月 2024 → …

指纹

探究 'Micro-gesture Online Recognition with Dual-stream Multi-scale Transformer in Long Videos' 的科研主题。它们共同构成独一无二的指纹。

引用此