Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning

Yang Yang; Haifei Chen; Xing Liu; Panfeng Huang

doi:10.1007/s10846-024-02147-7

Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning

Yang Yang, Haifei Chen, Xing Liu, Panfeng Huang

School of Astronautics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

To achieve psychological inclusion and skill development orientation in human skill training, this paper proposes a haptic-guided training strategy generation method with Deep Reinforcement Learning (DRL)-based agent as the core and Zone of Proximal Development (ZPD) tuning as the auxiliary. The information of the expert and trainee is stored first with a designed database that can be accessed in real-time, which establishes the data foundation. Then, under the DRL framework, a strategy generation agent is designed, which consists of an actor-network and two Q-networks. The former network generates the agent’s decision policy, while the other two Q-networks work to approximate the state-action value function, and the parameters of all of them are administrated by the Soft Actor-Critic (SAC) algorithm. In addition, for the first time, the psychological ZPD evaluation method is integrated into the strategy generation of the DRL-based agent, which is utilized to describe the relationship between a trainees intrinsic skills and guidance. With it, the problem of transitional guidance or insufficient guidance can be handled well. Finally, simulation experiments validate the proposed method, demonstrating its efficiency in regulating the trainee under favorable training conditions.

Original language	English
Article number	116
Journal	Journal of Intelligent and Robotic Systems: Theory and Applications
Volume	110
Issue number	3
DOIs	https://doi.org/10.1007/s10846-024-02147-7
State	Published - Sep 2024

Keywords

Deep reinforcement learning (DRL)
Haptic guided human skill training
Skill development orientation
Zone proximal development (ZPD)

Access to Document

10.1007/s10846-024-02147-7

Cite this

@article{8e4a9f088b7b4effb1415d853a0f6b29,

title = "Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning",

abstract = "To achieve psychological inclusion and skill development orientation in human skill training, this paper proposes a haptic-guided training strategy generation method with Deep Reinforcement Learning (DRL)-based agent as the core and Zone of Proximal Development (ZPD) tuning as the auxiliary. The information of the expert and trainee is stored first with a designed database that can be accessed in real-time, which establishes the data foundation. Then, under the DRL framework, a strategy generation agent is designed, which consists of an actor-network and two Q-networks. The former network generates the agent{\textquoteright}s decision policy, while the other two Q-networks work to approximate the state-action value function, and the parameters of all of them are administrated by the Soft Actor-Critic (SAC) algorithm. In addition, for the first time, the psychological ZPD evaluation method is integrated into the strategy generation of the DRL-based agent, which is utilized to describe the relationship between a trainees intrinsic skills and guidance. With it, the problem of transitional guidance or insufficient guidance can be handled well. Finally, simulation experiments validate the proposed method, demonstrating its efficiency in regulating the trainee under favorable training conditions.",

keywords = "Deep reinforcement learning (DRL), Haptic guided human skill training, Skill development orientation, Zone proximal development (ZPD)",

author = "Yang Yang and Haifei Chen and Xing Liu and Panfeng Huang",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",

year = "2024",

month = sep,

doi = "10.1007/s10846-024-02147-7",

language = "英语",

volume = "110",

journal = "Journal of Intelligent and Robotic Systems: Theory and Applications",

issn = "0921-0296",

publisher = "Springer Nature",

number = "3",

}

TY - JOUR

T1 - Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning

AU - Yang, Yang

AU - Chen, Haifei

AU - Liu, Xing

AU - Huang, Panfeng

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024/9

Y1 - 2024/9

N2 - To achieve psychological inclusion and skill development orientation in human skill training, this paper proposes a haptic-guided training strategy generation method with Deep Reinforcement Learning (DRL)-based agent as the core and Zone of Proximal Development (ZPD) tuning as the auxiliary. The information of the expert and trainee is stored first with a designed database that can be accessed in real-time, which establishes the data foundation. Then, under the DRL framework, a strategy generation agent is designed, which consists of an actor-network and two Q-networks. The former network generates the agent’s decision policy, while the other two Q-networks work to approximate the state-action value function, and the parameters of all of them are administrated by the Soft Actor-Critic (SAC) algorithm. In addition, for the first time, the psychological ZPD evaluation method is integrated into the strategy generation of the DRL-based agent, which is utilized to describe the relationship between a trainees intrinsic skills and guidance. With it, the problem of transitional guidance or insufficient guidance can be handled well. Finally, simulation experiments validate the proposed method, demonstrating its efficiency in regulating the trainee under favorable training conditions.

AB - To achieve psychological inclusion and skill development orientation in human skill training, this paper proposes a haptic-guided training strategy generation method with Deep Reinforcement Learning (DRL)-based agent as the core and Zone of Proximal Development (ZPD) tuning as the auxiliary. The information of the expert and trainee is stored first with a designed database that can be accessed in real-time, which establishes the data foundation. Then, under the DRL framework, a strategy generation agent is designed, which consists of an actor-network and two Q-networks. The former network generates the agent’s decision policy, while the other two Q-networks work to approximate the state-action value function, and the parameters of all of them are administrated by the Soft Actor-Critic (SAC) algorithm. In addition, for the first time, the psychological ZPD evaluation method is integrated into the strategy generation of the DRL-based agent, which is utilized to describe the relationship between a trainees intrinsic skills and guidance. With it, the problem of transitional guidance or insufficient guidance can be handled well. Finally, simulation experiments validate the proposed method, demonstrating its efficiency in regulating the trainee under favorable training conditions.

KW - Deep reinforcement learning (DRL)

KW - Haptic guided human skill training

KW - Skill development orientation

KW - Zone proximal development (ZPD)

UR - http://www.scopus.com/inward/record.url?scp=85200319339&partnerID=8YFLogxK

U2 - 10.1007/s10846-024-02147-7

DO - 10.1007/s10846-024-02147-7

M3 - 文章

AN - SCOPUS:85200319339

SN - 0921-0296

VL - 110

JO - Journal of Intelligent and Robotic Systems: Theory and Applications

JF - Journal of Intelligent and Robotic Systems: Theory and Applications

IS - 3

M1 - 116

ER -

Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this