TY - JOUR
T1 - High-resolution network with an auxiliary channel for 2D hand pose estimation
AU - Pan, Tianhong
AU - Wang, Zheng
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.
PY - 2024/4
Y1 - 2024/4
N2 - High-resolution networks have been applied in various fields because of their advanced architecture. However, multiple multi-scale fusions of high and low-dimensional semantic information during hand pose estimation can blur the position information obtained in high resolution, causing overfitting. To address this problem, we added an auxiliary channel parallel to the original network in this study. The auxiliary channel slices images using a slicing operation instead of a convolutional downscaling operation to preserve the full information in the input. The input is then computed by following four convolution layers to obtain the initial position correction information, and the results are combined with the network for prediction. Adding the auxiliary channel increases the number of parameters in the original network by only 0.7%, but obtains a high accuracy gain, which is particularly noticeable on lightweight networks. We performed several experiments to verify the effectiveness of this method using multiple datasets.
AB - High-resolution networks have been applied in various fields because of their advanced architecture. However, multiple multi-scale fusions of high and low-dimensional semantic information during hand pose estimation can blur the position information obtained in high resolution, causing overfitting. To address this problem, we added an auxiliary channel parallel to the original network in this study. The auxiliary channel slices images using a slicing operation instead of a convolutional downscaling operation to preserve the full information in the input. The input is then computed by following four convolution layers to obtain the initial position correction information, and the results are combined with the network for prediction. Adding the auxiliary channel increases the number of parameters in the original network by only 0.7%, but obtains a high accuracy gain, which is particularly noticeable on lightweight networks. We performed several experiments to verify the effectiveness of this method using multiple datasets.
KW - auxiliary channel
KW - High-resolution network (HRnet)
KW - multi-scale integration
KW - slice operation
UR - http://www.scopus.com/inward/record.url?scp=85162704586&partnerID=8YFLogxK
U2 - 10.1007/s11042-023-16045-x
DO - 10.1007/s11042-023-16045-x
M3 - 文章
AN - SCOPUS:85162704586
SN - 1380-7501
VL - 83
SP - 36683
EP - 36694
JO - Multimedia Tools and Applications
JF - Multimedia Tools and Applications
IS - 12
ER -