Micro-gesture Online Recognition with Graph-convolution and Multiscale Transformers for Long Sequence

Xu Peng Guo, Wei Peng, Hexiang Huang, Zhaoqiang Xia

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

Micro-gesture is becoming a fundamental clue of emotion analysis and achieves more attention in this field. The studies are mainly focused on the task of micro-gesture classification which predicts the categories of micro-gesture while no works have been reported for spotting the micro-gestures. As a preliminary step for classification, the micro-gesture online recognition (spotting) that predicts the temporal location and category has achieved limited attention. In this context, we propose a novel deep network for micro-gesture online recognition, which incorporates the graph-convolution and multiscale transformer encoders. Specifically, we utilize a graph-convolution based Transformer module to extract motion features of 2D skeleton sequences, which are then processed by a feature pyramid module to obtain hierarchical multiscale features. We further employ a local Transformer module to model the similarity between micro-gesture frames, and decouple the classification and regression branches to achieve accurate location and category. These Transformers are trained in a two-stage strategy and combined to perform the spotting. Our proposed method is validated on the iMiGUE dataset and has achieved the first ranking in the task of online recognition (Track 2) of the MiGA2023 Challenge.

Original languageEnglish
JournalCEUR Workshop Proceedings
Volume3522
StatePublished - 2023
Event1st IJCAI Workshop and Challenge on Micro-Gesture Analysis for Hidden Emotion Understanding, MiGA 2023 - Macau, China
Duration: 21 Aug 202322 Aug 2023

Keywords

  • Graph convolution
  • Micro-gesture online recognition
  • Multiscale Transformer

Fingerprint

Dive into the research topics of 'Micro-gesture Online Recognition with Graph-convolution and Multiscale Transformers for Long Sequence'. Together they form a unique fingerprint.

Cite this