Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning

Yang Yang; Haifei Chen; Xing Liu; Panfeng Huang

doi:10.1007/s10846-024-02147-7

Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning

Yang Yang, Haifei Chen, Xing Liu, Panfeng Huang

航天学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

1 引用（Scopus）

摘要

To achieve psychological inclusion and skill development orientation in human skill training, this paper proposes a haptic-guided training strategy generation method with Deep Reinforcement Learning (DRL)-based agent as the core and Zone of Proximal Development (ZPD) tuning as the auxiliary. The information of the expert and trainee is stored first with a designed database that can be accessed in real-time, which establishes the data foundation. Then, under the DRL framework, a strategy generation agent is designed, which consists of an actor-network and two Q-networks. The former network generates the agent’s decision policy, while the other two Q-networks work to approximate the state-action value function, and the parameters of all of them are administrated by the Soft Actor-Critic (SAC) algorithm. In addition, for the first time, the psychological ZPD evaluation method is integrated into the strategy generation of the DRL-based agent, which is utilized to describe the relationship between a trainees intrinsic skills and guidance. With it, the problem of transitional guidance or insufficient guidance can be handled well. Finally, simulation experiments validate the proposed method, demonstrating its efficiency in regulating the trainee under favorable training conditions.

源语言	英语
文章编号	116
期刊	Journal of Intelligent and Robotic Systems: Theory and Applications
卷	110
期	3
DOI	https://doi.org/10.1007/s10846-024-02147-7
出版状态	已出版 - 9月 2024

访问文件

10.1007/s10846-024-02147-7

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{8e4a9f088b7b4effb1415d853a0f6b29,

title = "Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning",

abstract = "To achieve psychological inclusion and skill development orientation in human skill training, this paper proposes a haptic-guided training strategy generation method with Deep Reinforcement Learning (DRL)-based agent as the core and Zone of Proximal Development (ZPD) tuning as the auxiliary. The information of the expert and trainee is stored first with a designed database that can be accessed in real-time, which establishes the data foundation. Then, under the DRL framework, a strategy generation agent is designed, which consists of an actor-network and two Q-networks. The former network generates the agent{\textquoteright}s decision policy, while the other two Q-networks work to approximate the state-action value function, and the parameters of all of them are administrated by the Soft Actor-Critic (SAC) algorithm. In addition, for the first time, the psychological ZPD evaluation method is integrated into the strategy generation of the DRL-based agent, which is utilized to describe the relationship between a trainees intrinsic skills and guidance. With it, the problem of transitional guidance or insufficient guidance can be handled well. Finally, simulation experiments validate the proposed method, demonstrating its efficiency in regulating the trainee under favorable training conditions.",

keywords = "Deep reinforcement learning (DRL), Haptic guided human skill training, Skill development orientation, Zone proximal development (ZPD)",

author = "Yang Yang and Haifei Chen and Xing Liu and Panfeng Huang",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",

year = "2024",

month = sep,

doi = "10.1007/s10846-024-02147-7",

language = "英语",

volume = "110",

journal = "Journal of Intelligent and Robotic Systems: Theory and Applications",

issn = "0921-0296",

publisher = "Springer Nature",

number = "3",

}

TY - JOUR

T1 - Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning

AU - Yang, Yang

AU - Chen, Haifei

AU - Liu, Xing

AU - Huang, Panfeng

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024/9

Y1 - 2024/9

N2 - To achieve psychological inclusion and skill development orientation in human skill training, this paper proposes a haptic-guided training strategy generation method with Deep Reinforcement Learning (DRL)-based agent as the core and Zone of Proximal Development (ZPD) tuning as the auxiliary. The information of the expert and trainee is stored first with a designed database that can be accessed in real-time, which establishes the data foundation. Then, under the DRL framework, a strategy generation agent is designed, which consists of an actor-network and two Q-networks. The former network generates the agent’s decision policy, while the other two Q-networks work to approximate the state-action value function, and the parameters of all of them are administrated by the Soft Actor-Critic (SAC) algorithm. In addition, for the first time, the psychological ZPD evaluation method is integrated into the strategy generation of the DRL-based agent, which is utilized to describe the relationship between a trainees intrinsic skills and guidance. With it, the problem of transitional guidance or insufficient guidance can be handled well. Finally, simulation experiments validate the proposed method, demonstrating its efficiency in regulating the trainee under favorable training conditions.

AB - To achieve psychological inclusion and skill development orientation in human skill training, this paper proposes a haptic-guided training strategy generation method with Deep Reinforcement Learning (DRL)-based agent as the core and Zone of Proximal Development (ZPD) tuning as the auxiliary. The information of the expert and trainee is stored first with a designed database that can be accessed in real-time, which establishes the data foundation. Then, under the DRL framework, a strategy generation agent is designed, which consists of an actor-network and two Q-networks. The former network generates the agent’s decision policy, while the other two Q-networks work to approximate the state-action value function, and the parameters of all of them are administrated by the Soft Actor-Critic (SAC) algorithm. In addition, for the first time, the psychological ZPD evaluation method is integrated into the strategy generation of the DRL-based agent, which is utilized to describe the relationship between a trainees intrinsic skills and guidance. With it, the problem of transitional guidance or insufficient guidance can be handled well. Finally, simulation experiments validate the proposed method, demonstrating its efficiency in regulating the trainee under favorable training conditions.

KW - Deep reinforcement learning (DRL)

KW - Haptic guided human skill training

KW - Skill development orientation

KW - Zone proximal development (ZPD)

UR - http://www.scopus.com/inward/record.url?scp=85200319339&partnerID=8YFLogxK

U2 - 10.1007/s10846-024-02147-7

DO - 10.1007/s10846-024-02147-7

M3 - 文章

AN - SCOPUS:85200319339

SN - 0921-0296

VL - 110

JO - Journal of Intelligent and Robotic Systems: Theory and Applications

JF - Journal of Intelligent and Robotic Systems: Theory and Applications

IS - 3

M1 - 116

ER -

Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning

摘要

访问文件

其它文件与链接

指纹

引用此