Learning shape-motion representations from geometric algebra spatio-temporal model for skeleton-based action recognition

Yanshan Li, Rongjie Xia, Xing Liu, Qinghua Huang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

58 Scopus citations

Abstract

Skeleton-based action recognition has been widely applied in intelligent video surveillance and human behavior analysis. Previous works have successfully applied Convolutional Neural Networks (CNN) to learn spatio-temporal characteristics of the skeleton sequence. However, they merely focus on the coordinates of isolated joints, which ignore the spatial relationships between joints and only implicitly learn the motion representations. To solve these problems, we propose an effective method to learn comprehensive representations from skeleton sequences by using Geometric Algebra. Firstly, a frontal orientation based spatio-temporal model is constructed to represent the spatial configuration and temporal dynamics of skeleton sequences, which owns the robustness against view variations. Then the shape-motion representations which mutually compensate are learned to describe skeleton actions comprehensively. Finally, a multi-stream CNN model is applied to extract and fuse deep features from the complementary shape-motion representations. Experimental results on NTU RGB+D and Northwestern-UCLA datasets consistently verify the superiority of our method.

Original languageEnglish
Title of host publicationProceedings - 2019 IEEE International Conference on Multimedia and Expo, ICME 2019
PublisherIEEE Computer Society
Pages1066-1071
Number of pages6
ISBN (Electronic)9781538695524
DOIs
StatePublished - Jul 2019
Event2019 IEEE International Conference on Multimedia and Expo, ICME 2019 - Shanghai, China
Duration: 8 Jul 201912 Jul 2019

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume2019-July
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2019 IEEE International Conference on Multimedia and Expo, ICME 2019
Country/TerritoryChina
CityShanghai
Period8/07/1912/07/19

Keywords

  • Geometric algebra
  • Human action recognition
  • Skeleton sequence
  • Spatio-temporal model

Fingerprint

Dive into the research topics of 'Learning shape-motion representations from geometric algebra spatio-temporal model for skeleton-based action recognition'. Together they form a unique fingerprint.

Cite this