High-resolution network with an auxiliary channel for 2D hand pose estimation

Tianhong Pan, Zheng Wang

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

High-resolution networks have been applied in various fields because of their advanced architecture. However, multiple multi-scale fusions of high and low-dimensional semantic information during hand pose estimation can blur the position information obtained in high resolution, causing overfitting. To address this problem, we added an auxiliary channel parallel to the original network in this study. The auxiliary channel slices images using a slicing operation instead of a convolutional downscaling operation to preserve the full information in the input. The input is then computed by following four convolution layers to obtain the initial position correction information, and the results are combined with the network for prediction. Adding the auxiliary channel increases the number of parameters in the original network by only 0.7%, but obtains a high accuracy gain, which is particularly noticeable on lightweight networks. We performed several experiments to verify the effectiveness of this method using multiple datasets.

Original languageEnglish
Pages (from-to)36683-36694
Number of pages12
JournalMultimedia Tools and Applications
Volume83
Issue number12
DOIs
StatePublished - Apr 2024
Externally publishedYes

Keywords

  • auxiliary channel
  • High-resolution network (HRnet)
  • multi-scale integration
  • slice operation

Fingerprint

Dive into the research topics of 'High-resolution network with an auxiliary channel for 2D hand pose estimation'. Together they form a unique fingerprint.

Cite this