Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning

Yang Yang, Haifei Chen, Xing Liu, Panfeng Huang

科研成果: 期刊稿件文章同行评审

1 引用 (Scopus)

摘要

To achieve psychological inclusion and skill development orientation in human skill training, this paper proposes a haptic-guided training strategy generation method with Deep Reinforcement Learning (DRL)-based agent as the core and Zone of Proximal Development (ZPD) tuning as the auxiliary. The information of the expert and trainee is stored first with a designed database that can be accessed in real-time, which establishes the data foundation. Then, under the DRL framework, a strategy generation agent is designed, which consists of an actor-network and two Q-networks. The former network generates the agent’s decision policy, while the other two Q-networks work to approximate the state-action value function, and the parameters of all of them are administrated by the Soft Actor-Critic (SAC) algorithm. In addition, for the first time, the psychological ZPD evaluation method is integrated into the strategy generation of the DRL-based agent, which is utilized to describe the relationship between a trainees intrinsic skills and guidance. With it, the problem of transitional guidance or insufficient guidance can be handled well. Finally, simulation experiments validate the proposed method, demonstrating its efficiency in regulating the trainee under favorable training conditions.

源语言英语
文章编号116
期刊Journal of Intelligent and Robotic Systems: Theory and Applications
110
3
DOI
出版状态已出版 - 9月 2024

指纹

探究 'Guidance-As-Progressive in Human Skill Training Based on Deep Reinforcement Learning' 的科研主题。它们共同构成独一无二的指纹。

引用此