U2Net-MGP: A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories

Wenbai Chen; Bo Zhang; Jingchen Li; Xin Zhao; Yiqun Wang; Jianping Gou; Haobin Shi

doi:10.1109/TCE.2024.3424671

U2Net-MGP: A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories

Wenbai Chen, Bo Zhang, Jingchen Li, Xin Zhao, Yiqun Wang, Jianping Gou, Haobin Shi

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

In the assembly of consumer electronic products, target detection methods offer details on the target’s location and category, but precise positioning with the robotic arm’s end-effector demands pixel-level edge contour data of the target. As a result, we’ve developed U2Net-MGP, a lightweight and efficient visual perception model. This model effectively captures edge contours for positioning consumer electronic components. Firstly, the residual U-blocks (RSU) in U2Net have been replaced with the ghost convolution residual U-blocks (GRSU) as designed in this paper. This change reduces the complexity of the model while improving accuracy. Furthermore, we’ve introduced polarized self-attention and created a polarized self-attention feature fusion module (PFF). This innovation enables the model to capture both local and global information effectively, enhances the modeling capacity of the feature data, and ultimately improves the accuracy of pixel regression. In this paper, we conducted ablation experiments and comparative experiments on the consumer electronics components dataset. The results reveal that the U2Net-MGP model is both compact and efficacious, markedly bolstering segmentation capability while reducing to 72.3% of the baseline model’s size. Relative to the original, it manifests increases of 3.2%, 5.3%, 3.6%, and 4.2% in the precision, recall, Fβ, and mean absolute error, respectively.

源语言	英语
页（从-至）	1
页数	1
期刊	IEEE Transactions on Consumer Electronics
DOI	https://doi.org/10.1109/TCE.2024.3424671
出版状态	已接受/待刊 - 2024

访问文件

10.1109/TCE.2024.3424671

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{009c174f0bde42439b473284989327a6,

title = "U2Net-MGP: A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories",

abstract = "In the assembly of consumer electronic products, target detection methods offer details on the target{\textquoteright}s location and category, but precise positioning with the robotic arm{\textquoteright}s end-effector demands pixel-level edge contour data of the target. As a result, we{\textquoteright}ve developed U2Net-MGP, a lightweight and efficient visual perception model. This model effectively captures edge contours for positioning consumer electronic components. Firstly, the residual U-blocks (RSU) in U2Net have been replaced with the ghost convolution residual U-blocks (GRSU) as designed in this paper. This change reduces the complexity of the model while improving accuracy. Furthermore, we{\textquoteright}ve introduced polarized self-attention and created a polarized self-attention feature fusion module (PFF). This innovation enables the model to capture both local and global information effectively, enhances the modeling capacity of the feature data, and ultimately improves the accuracy of pixel regression. In this paper, we conducted ablation experiments and comparative experiments on the consumer electronics components dataset. The results reveal that the U2Net-MGP model is both compact and efficacious, markedly bolstering segmentation capability while reducing to 72.3% of the baseline model{\textquoteright}s size. Relative to the original, it manifests increases of 3.2%, 5.3%, 3.6%, and 4.2% in the precision, recall, Fβ, and mean absolute error, respectively.",

keywords = "Accuracy, Assembly, Computational modeling, Consumer Electronics, Consumer electronics, Decoding, Feature extraction, Ghost convolution, Image segmentation, Multi-scale feature fusion, Polarized self-attention mechanism, Salient object segmentation",

author = "Wenbai Chen and Bo Zhang and Jingchen Li and Xin Zhao and Yiqun Wang and Jianping Gou and Haobin Shi",

note = "Publisher Copyright: IEEE",

year = "2024",

doi = "10.1109/TCE.2024.3424671",

language = "英语",

pages = "1",

journal = "IEEE Transactions on Consumer Electronics",

issn = "0098-3063",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - U2Net-MGP

T2 - A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories

AU - Chen, Wenbai

AU - Zhang, Bo

AU - Li, Jingchen

AU - Zhao, Xin

AU - Wang, Yiqun

AU - Gou, Jianping

AU - Shi, Haobin

N1 - Publisher Copyright: IEEE

PY - 2024

Y1 - 2024

N2 - In the assembly of consumer electronic products, target detection methods offer details on the target’s location and category, but precise positioning with the robotic arm’s end-effector demands pixel-level edge contour data of the target. As a result, we’ve developed U2Net-MGP, a lightweight and efficient visual perception model. This model effectively captures edge contours for positioning consumer electronic components. Firstly, the residual U-blocks (RSU) in U2Net have been replaced with the ghost convolution residual U-blocks (GRSU) as designed in this paper. This change reduces the complexity of the model while improving accuracy. Furthermore, we’ve introduced polarized self-attention and created a polarized self-attention feature fusion module (PFF). This innovation enables the model to capture both local and global information effectively, enhances the modeling capacity of the feature data, and ultimately improves the accuracy of pixel regression. In this paper, we conducted ablation experiments and comparative experiments on the consumer electronics components dataset. The results reveal that the U2Net-MGP model is both compact and efficacious, markedly bolstering segmentation capability while reducing to 72.3% of the baseline model’s size. Relative to the original, it manifests increases of 3.2%, 5.3%, 3.6%, and 4.2% in the precision, recall, Fβ, and mean absolute error, respectively.

AB - In the assembly of consumer electronic products, target detection methods offer details on the target’s location and category, but precise positioning with the robotic arm’s end-effector demands pixel-level edge contour data of the target. As a result, we’ve developed U2Net-MGP, a lightweight and efficient visual perception model. This model effectively captures edge contours for positioning consumer electronic components. Firstly, the residual U-blocks (RSU) in U2Net have been replaced with the ghost convolution residual U-blocks (GRSU) as designed in this paper. This change reduces the complexity of the model while improving accuracy. Furthermore, we’ve introduced polarized self-attention and created a polarized self-attention feature fusion module (PFF). This innovation enables the model to capture both local and global information effectively, enhances the modeling capacity of the feature data, and ultimately improves the accuracy of pixel regression. In this paper, we conducted ablation experiments and comparative experiments on the consumer electronics components dataset. The results reveal that the U2Net-MGP model is both compact and efficacious, markedly bolstering segmentation capability while reducing to 72.3% of the baseline model’s size. Relative to the original, it manifests increases of 3.2%, 5.3%, 3.6%, and 4.2% in the precision, recall, Fβ, and mean absolute error, respectively.

KW - Accuracy

KW - Assembly

KW - Computational modeling

KW - Consumer Electronics

KW - Consumer electronics

KW - Decoding

KW - Feature extraction

KW - Ghost convolution

KW - Image segmentation

KW - Multi-scale feature fusion

KW - Polarized self-attention mechanism

KW - Salient object segmentation

UR - http://www.scopus.com/inward/record.url?scp=85198751363&partnerID=8YFLogxK

U2 - 10.1109/TCE.2024.3424671

DO - 10.1109/TCE.2024.3424671

M3 - 文章

AN - SCOPUS:85198751363

SN - 0098-3063

SP - 1

JO - IEEE Transactions on Consumer Electronics

JF - IEEE Transactions on Consumer Electronics

ER -

U2Net-MGP: A Lightweight and Efficient Visual Perception Algorithm for Consumer Electronic Accessories

摘要

访问文件

其它文件与链接

指纹

引用此