TY - JOUR
T1 - U2Net-MGP
T2 - A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories
AU - Chen, Wenbai
AU - Zhang, Bo
AU - Li, Jingchen
AU - Zhao, Xin
AU - Wang, Yiqun
AU - Gou, Jianping
AU - Shi, Haobin
N1 - Publisher Copyright:
IEEE
PY - 2024
Y1 - 2024
N2 - In the assembly of consumer electronic products, target detection methods offer details on the target’s location and category, but precise positioning with the robotic arm’s end-effector demands pixel-level edge contour data of the target. As a result, we’ve developed U2Net-MGP, a lightweight and efficient visual perception model. This model effectively captures edge contours for positioning consumer electronic components. Firstly, the residual U-blocks (RSU) in U2Net have been replaced with the ghost convolution residual U-blocks (GRSU) as designed in this paper. This change reduces the complexity of the model while improving accuracy. Furthermore, we’ve introduced polarized self-attention and created a polarized self-attention feature fusion module (PFF). This innovation enables the model to capture both local and global information effectively, enhances the modeling capacity of the feature data, and ultimately improves the accuracy of pixel regression. In this paper, we conducted ablation experiments and comparative experiments on the consumer electronics components dataset. The results reveal that the U2Net-MGP model is both compact and efficacious, markedly bolstering segmentation capability while reducing to 72.3% of the baseline model’s size. Relative to the original, it manifests increases of 3.2%, 5.3%, 3.6%, and 4.2% in the precision, recall, Fβ, and mean absolute error, respectively.
AB - In the assembly of consumer electronic products, target detection methods offer details on the target’s location and category, but precise positioning with the robotic arm’s end-effector demands pixel-level edge contour data of the target. As a result, we’ve developed U2Net-MGP, a lightweight and efficient visual perception model. This model effectively captures edge contours for positioning consumer electronic components. Firstly, the residual U-blocks (RSU) in U2Net have been replaced with the ghost convolution residual U-blocks (GRSU) as designed in this paper. This change reduces the complexity of the model while improving accuracy. Furthermore, we’ve introduced polarized self-attention and created a polarized self-attention feature fusion module (PFF). This innovation enables the model to capture both local and global information effectively, enhances the modeling capacity of the feature data, and ultimately improves the accuracy of pixel regression. In this paper, we conducted ablation experiments and comparative experiments on the consumer electronics components dataset. The results reveal that the U2Net-MGP model is both compact and efficacious, markedly bolstering segmentation capability while reducing to 72.3% of the baseline model’s size. Relative to the original, it manifests increases of 3.2%, 5.3%, 3.6%, and 4.2% in the precision, recall, Fβ, and mean absolute error, respectively.
KW - Accuracy
KW - Assembly
KW - Computational modeling
KW - Consumer Electronics
KW - Consumer electronics
KW - Decoding
KW - Feature extraction
KW - Ghost convolution
KW - Image segmentation
KW - Multi-scale feature fusion
KW - Polarized self-attention mechanism
KW - Salient object segmentation
UR - http://www.scopus.com/inward/record.url?scp=85198751363&partnerID=8YFLogxK
U2 - 10.1109/TCE.2024.3424671
DO - 10.1109/TCE.2024.3424671
M3 - 文章
AN - SCOPUS:85198751363
SN - 0098-3063
SP - 1
JO - IEEE Transactions on Consumer Electronics
JF - IEEE Transactions on Consumer Electronics
ER -