Adaptive path planning for wafer second probing via an attention-based hierarchical reinforcement learning framework with shared memory

Haobin Shi; Ziming He; Kao Shing Hwang

doi:10.1016/j.ins.2025.122089

Adaptive path planning for wafer second probing via an attention-based hierarchical reinforcement learning framework with shared memory

Haobin Shi, Ziming He, Kao Shing Hwang

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

In semiconductor manufacturing, wafer probing is a quality control process before packaging, usually performed by an automated machine with a fixed path. The unqualified grains in the first detection need to be confirmed again. The fixed path method is inefficient and requires manual intervention for the second wafer probing on randomly scattered grains. To this end, we propose a reinforcement learning-based adaptive path planning method for second wafer probing. To simplify decision-making in a large state space, we propose a novel attention-based hierarchical reinforcement learning method with shared memory (AHRL-SM) and introduce it into wafer probing for the first time. The high-level agent is responsible for focusing on the region with a large number of grains to be detected, while the low-level agent is responsible for planning the moving path of the probe in the specified sub-region. The soft attention mechanism and recurrent neural network are incorporated into the probing architecture to facilitate original image feature extraction and historical information acquisition, respectively. In addition, we propose a unique shared memory mechanism to further improve decision-making efficiency. The Markov decision process of the complete wafer second probing and the performance verification of the proposed method are thoroughly described in this work. Compared with the existing path planning methods for wafer probing, sufficient experimental results confirm that our method has obvious advantages in probing efficiency, grain surface protection, and generalization.

源语言	英语
文章编号	122089
期刊	Information Sciences
卷	710
DOI	https://doi.org/10.1016/j.ins.2025.122089
出版状态	已出版 - 8月 2025

访问文件

10.1016/j.ins.2025.122089

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{dea1eb29d5b0422c9a53dcf9a4617247,

title = "Adaptive path planning for wafer second probing via an attention-based hierarchical reinforcement learning framework with shared memory",

abstract = "In semiconductor manufacturing, wafer probing is a quality control process before packaging, usually performed by an automated machine with a fixed path. The unqualified grains in the first detection need to be confirmed again. The fixed path method is inefficient and requires manual intervention for the second wafer probing on randomly scattered grains. To this end, we propose a reinforcement learning-based adaptive path planning method for second wafer probing. To simplify decision-making in a large state space, we propose a novel attention-based hierarchical reinforcement learning method with shared memory (AHRL-SM) and introduce it into wafer probing for the first time. The high-level agent is responsible for focusing on the region with a large number of grains to be detected, while the low-level agent is responsible for planning the moving path of the probe in the specified sub-region. The soft attention mechanism and recurrent neural network are incorporated into the probing architecture to facilitate original image feature extraction and historical information acquisition, respectively. In addition, we propose a unique shared memory mechanism to further improve decision-making efficiency. The Markov decision process of the complete wafer second probing and the performance verification of the proposed method are thoroughly described in this work. Compared with the existing path planning methods for wafer probing, sufficient experimental results confirm that our method has obvious advantages in probing efficiency, grain surface protection, and generalization.",

keywords = "Attention mechanism, Hierarchical reinforcement learning, Path planning, Wafer probing",

author = "Haobin Shi and Ziming He and Hwang, {Kao Shing}",

note = "Publisher Copyright: {\textcopyright} 2025 Elsevier Inc.",

year = "2025",

month = aug,

doi = "10.1016/j.ins.2025.122089",

language = "英语",

volume = "710",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - Adaptive path planning for wafer second probing via an attention-based hierarchical reinforcement learning framework with shared memory

AU - Shi, Haobin

AU - He, Ziming

AU - Hwang, Kao Shing

PY - 2025/8

Y1 - 2025/8

N2 - In semiconductor manufacturing, wafer probing is a quality control process before packaging, usually performed by an automated machine with a fixed path. The unqualified grains in the first detection need to be confirmed again. The fixed path method is inefficient and requires manual intervention for the second wafer probing on randomly scattered grains. To this end, we propose a reinforcement learning-based adaptive path planning method for second wafer probing. To simplify decision-making in a large state space, we propose a novel attention-based hierarchical reinforcement learning method with shared memory (AHRL-SM) and introduce it into wafer probing for the first time. The high-level agent is responsible for focusing on the region with a large number of grains to be detected, while the low-level agent is responsible for planning the moving path of the probe in the specified sub-region. The soft attention mechanism and recurrent neural network are incorporated into the probing architecture to facilitate original image feature extraction and historical information acquisition, respectively. In addition, we propose a unique shared memory mechanism to further improve decision-making efficiency. The Markov decision process of the complete wafer second probing and the performance verification of the proposed method are thoroughly described in this work. Compared with the existing path planning methods for wafer probing, sufficient experimental results confirm that our method has obvious advantages in probing efficiency, grain surface protection, and generalization.

AB - In semiconductor manufacturing, wafer probing is a quality control process before packaging, usually performed by an automated machine with a fixed path. The unqualified grains in the first detection need to be confirmed again. The fixed path method is inefficient and requires manual intervention for the second wafer probing on randomly scattered grains. To this end, we propose a reinforcement learning-based adaptive path planning method for second wafer probing. To simplify decision-making in a large state space, we propose a novel attention-based hierarchical reinforcement learning method with shared memory (AHRL-SM) and introduce it into wafer probing for the first time. The high-level agent is responsible for focusing on the region with a large number of grains to be detected, while the low-level agent is responsible for planning the moving path of the probe in the specified sub-region. The soft attention mechanism and recurrent neural network are incorporated into the probing architecture to facilitate original image feature extraction and historical information acquisition, respectively. In addition, we propose a unique shared memory mechanism to further improve decision-making efficiency. The Markov decision process of the complete wafer second probing and the performance verification of the proposed method are thoroughly described in this work. Compared with the existing path planning methods for wafer probing, sufficient experimental results confirm that our method has obvious advantages in probing efficiency, grain surface protection, and generalization.

KW - Attention mechanism

KW - Hierarchical reinforcement learning

KW - Path planning

KW - Wafer probing

UR - http://www.scopus.com/inward/record.url?scp=105000322987&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2025.122089

DO - 10.1016/j.ins.2025.122089

M3 - 文章

AN - SCOPUS:105000322987

SN - 0020-0255

VL - 710

JO - Information Sciences

JF - Information Sciences

M1 - 122089

ER -

Adaptive path planning for wafer second probing via an attention-based hierarchical reinforcement learning framework with shared memory

摘要

访问文件

其它文件与链接

指纹

引用此