TY - JOUR
T1 - Episode-Fuzzy-COACH Method for Fast Robot Skill Learning
AU - Li, Bingqian
AU - Liu, Xing
AU - Liu, Zhengxiong
AU - Huang, Panfeng
N1 - Publisher Copyright:
© 1982-2012 IEEE.
PY - 2024/6/1
Y1 - 2024/6/1
N2 - To realize robot skill learning in the real world, reinforcement learning algorithms need to be applied in continuous problems with high sample efficiency. Hybrid intelligence is regarded as an available solution for this problem, due to the ability to speed up the learning process with human knowledge and experience. Therefore, we propose Episode-Fuzzy-COACH (COrrective Advice Communicated by Humans), to imitate human fuzzy logic and involve human intelligence in the learning process. In this framework, human knowledge and experience are involved in the learning process, which are provided by human feedback and fuzzy rules designed by human users. Moreover, it is combined with Path Integrals Policy Improvement (PI2), to realize hybrid intelligence, which is used to realize fast robot skill learning. Throwing Movement Primitives proposed in this article is used to represent the policy of ball-throwing skill. According to the simulation results, the learning efficiency of our method is increased by 72% and 42.86%, respectively, compared with pure PI2 and PI2+COACH. Our method validated in experiments is 46.67% more effective than PI2+COACH. The results also show that the performance of our method is not affected by users' knowledge level of the related field. It is proven that PI2+Episode-Fuzzy-COACH is available for fast robot skill learning.
AB - To realize robot skill learning in the real world, reinforcement learning algorithms need to be applied in continuous problems with high sample efficiency. Hybrid intelligence is regarded as an available solution for this problem, due to the ability to speed up the learning process with human knowledge and experience. Therefore, we propose Episode-Fuzzy-COACH (COrrective Advice Communicated by Humans), to imitate human fuzzy logic and involve human intelligence in the learning process. In this framework, human knowledge and experience are involved in the learning process, which are provided by human feedback and fuzzy rules designed by human users. Moreover, it is combined with Path Integrals Policy Improvement (PI2), to realize hybrid intelligence, which is used to realize fast robot skill learning. Throwing Movement Primitives proposed in this article is used to represent the policy of ball-throwing skill. According to the simulation results, the learning efficiency of our method is increased by 72% and 42.86%, respectively, compared with pure PI2 and PI2+COACH. Our method validated in experiments is 46.67% more effective than PI2+COACH. The results also show that the performance of our method is not affected by users' knowledge level of the related field. It is proven that PI2+Episode-Fuzzy-COACH is available for fast robot skill learning.
KW - Hybrid intelligence
KW - interactive reinforcement learning
KW - robot skill learning
UR - http://www.scopus.com/inward/record.url?scp=85165411043&partnerID=8YFLogxK
U2 - 10.1109/TIE.2023.3294600
DO - 10.1109/TIE.2023.3294600
M3 - 文章
AN - SCOPUS:85165411043
SN - 0278-0046
VL - 71
SP - 5931
EP - 5940
JO - IEEE Transactions on Industrial Electronics
JF - IEEE Transactions on Industrial Electronics
IS - 6
ER -