Skeleton boxes: Solving skeleton based action detection with a single deep convolutional neural network

Bo Li, Yuchao Dai, Xuelian Cheng, Huahui Chen, Yi Lin, Mingyi He

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

27 Scopus citations

Abstract

Action recognition from well-segmented 3D skeleton video has been intensively studied. However, due to the difficulty in representing the 3D skeleton video and the lack of training data, action detection from streaming 3D skeleton video still lags far behind its recognition counterpart and image-based object detection. In this paper, we propose a novel approach for this problem, which leverages both effective skeleton video encoding and deep regression based object detection from images. Our framework consists of two parts: skeleton-based video image mapping, which encodes a skeleton video to a color image in a temporal preserving way, and an end-to-end trainable fast skeleton action detector (Skeleton Boxes) based on image detection. Experimental results on the latest and largest PKU-MMD benchmark dataset demonstrate that our method outperforms the state-of-the-art methods with a large margin. We believe our idea would inspire and benefit future research in this important area.

Original languageEnglish
Title of host publication2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages613-616
Number of pages4
ISBN (Electronic)9781538605608
DOIs
StatePublished - 5 Sep 2017
Externally publishedYes
Event2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017 - Hong Kong, Hong Kong
Duration: 10 Jul 201714 Jul 2017

Publication series

Name2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017

Conference

Conference2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017
Country/TerritoryHong Kong
CityHong Kong
Period10/07/1714/07/17

Keywords

  • CNN
  • detection
  • end-to-end
  • skeleton

Fingerprint

Dive into the research topics of 'Skeleton boxes: Solving skeleton based action detection with a single deep convolutional neural network'. Together they form a unique fingerprint.

Cite this