TY - JOUR
T1 - Optimized convolutional pose machine for 2D hand pose estimation
AU - Pan, Tianhong
AU - Wang, Zheng
AU - Fan, Yuan
N1 - Publisher Copyright:
© 2022 Elsevier Inc.
PY - 2022/2
Y1 - 2022/2
N2 - Hand pose estimation is a challenging task owing to the high flexibility and serious self-occlusion of the hand. Therefore, an optimized convolutional pose machine (OCPM) was proposed in this study to estimate the hand pose accurately. Traditional CPMs have two components, a feature extraction module and an information processing module. First, the backbone network of the feature extraction module was replaced by Resnet-18 to reduce the number of network parameters. Furthermore, an attention module called the convolutional block attention module (CBAM) is embedded into the feature extraction module to enhance the information extraction. Then, the structure of the information processing module was adjusted through a residual connection in each stage that consist of a series of continuous convolutional operations, and requires a dense fusion between the output from all previous stages and the feature extraction module. The experimental results on two public datasets showed that the OCPM network achieved excellent performance.
AB - Hand pose estimation is a challenging task owing to the high flexibility and serious self-occlusion of the hand. Therefore, an optimized convolutional pose machine (OCPM) was proposed in this study to estimate the hand pose accurately. Traditional CPMs have two components, a feature extraction module and an information processing module. First, the backbone network of the feature extraction module was replaced by Resnet-18 to reduce the number of network parameters. Furthermore, an attention module called the convolutional block attention module (CBAM) is embedded into the feature extraction module to enhance the information extraction. Then, the structure of the information processing module was adjusted through a residual connection in each stage that consist of a series of continuous convolutional operations, and requires a dense fusion between the output from all previous stages and the feature extraction module. The experimental results on two public datasets showed that the OCPM network achieved excellent performance.
KW - 2D hand pose estimation
KW - Convolutional block attention module (CBAM)
KW - Convolutional pose machine (CPM)
KW - Feature fusion
KW - Resnet-18
UR - http://www.scopus.com/inward/record.url?scp=85124689606&partnerID=8YFLogxK
U2 - 10.1016/j.jvcir.2022.103461
DO - 10.1016/j.jvcir.2022.103461
M3 - 文章
AN - SCOPUS:85124689606
SN - 1047-3203
VL - 83
JO - Journal of Visual Communication and Image Representation
JF - Journal of Visual Communication and Image Representation
M1 - 103461
ER -