Episode-Fuzzy-COACH Method for Fast Robot Skill Learning

Bingqian Li; Xing Liu; Zhengxiong Liu; Panfeng Huang

doi:10.1109/TIE.2023.3294600

Episode-Fuzzy-COACH Method for Fast Robot Skill Learning

Bingqian Li, Xing Liu, Zhengxiong Liu, Panfeng Huang

School of Astronautics

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

2 Scopus citations

Abstract

To realize robot skill learning in the real world, reinforcement learning algorithms need to be applied in continuous problems with high sample efficiency. Hybrid intelligence is regarded as an available solution for this problem, due to the ability to speed up the learning process with human knowledge and experience. Therefore, we propose Episode-Fuzzy-COACH (COrrective Advice Communicated by Humans), to imitate human fuzzy logic and involve human intelligence in the learning process. In this framework, human knowledge and experience are involved in the learning process, which are provided by human feedback and fuzzy rules designed by human users. Moreover, it is combined with Path Integrals Policy Improvement (PI2), to realize hybrid intelligence, which is used to realize fast robot skill learning. Throwing Movement Primitives proposed in this article is used to represent the policy of ball-throwing skill. According to the simulation results, the learning efficiency of our method is increased by 72% and 42.86%, respectively, compared with pure PI2 and PI2+COACH. Our method validated in experiments is 46.67% more effective than PI2+COACH. The results also show that the performance of our method is not affected by users' knowledge level of the related field. It is proven that PI2+Episode-Fuzzy-COACH is available for fast robot skill learning.

Original language	English
Pages (from-to)	5931-5940
Number of pages	10
Journal	IEEE Transactions on Industrial Electronics
Volume	71
Issue number	6
DOIs	https://doi.org/10.1109/TIE.2023.3294600
State	Published - 1 Jun 2024

Keywords

Hybrid intelligence
interactive reinforcement learning
robot skill learning

Access to Document

10.1109/TIE.2023.3294600

Cite this

@article{91266383534b4c97bceb63e2753e8287,

title = "Episode-Fuzzy-COACH Method for Fast Robot Skill Learning",

abstract = "To realize robot skill learning in the real world, reinforcement learning algorithms need to be applied in continuous problems with high sample efficiency. Hybrid intelligence is regarded as an available solution for this problem, due to the ability to speed up the learning process with human knowledge and experience. Therefore, we propose Episode-Fuzzy-COACH (COrrective Advice Communicated by Humans), to imitate human fuzzy logic and involve human intelligence in the learning process. In this framework, human knowledge and experience are involved in the learning process, which are provided by human feedback and fuzzy rules designed by human users. Moreover, it is combined with Path Integrals Policy Improvement (PI2), to realize hybrid intelligence, which is used to realize fast robot skill learning. Throwing Movement Primitives proposed in this article is used to represent the policy of ball-throwing skill. According to the simulation results, the learning efficiency of our method is increased by 72% and 42.86%, respectively, compared with pure PI2 and PI2+COACH. Our method validated in experiments is 46.67% more effective than PI2+COACH. The results also show that the performance of our method is not affected by users' knowledge level of the related field. It is proven that PI2+Episode-Fuzzy-COACH is available for fast robot skill learning.",

keywords = "Hybrid intelligence, interactive reinforcement learning, robot skill learning",

author = "Bingqian Li and Xing Liu and Zhengxiong Liu and Panfeng Huang",

note = "Publisher Copyright: {\textcopyright} 1982-2012 IEEE.",

year = "2024",

month = jun,

day = "1",

doi = "10.1109/TIE.2023.3294600",

language = "英语",

volume = "71",

pages = "5931--5940",

journal = "IEEE Transactions on Industrial Electronics",

issn = "0278-0046",

publisher = "IEEE Industrial Electronics Society",

number = "6",

}

TY - JOUR

T1 - Episode-Fuzzy-COACH Method for Fast Robot Skill Learning

AU - Li, Bingqian

AU - Liu, Xing

AU - Liu, Zhengxiong

AU - Huang, Panfeng

PY - 2024/6/1

Y1 - 2024/6/1

N2 - To realize robot skill learning in the real world, reinforcement learning algorithms need to be applied in continuous problems with high sample efficiency. Hybrid intelligence is regarded as an available solution for this problem, due to the ability to speed up the learning process with human knowledge and experience. Therefore, we propose Episode-Fuzzy-COACH (COrrective Advice Communicated by Humans), to imitate human fuzzy logic and involve human intelligence in the learning process. In this framework, human knowledge and experience are involved in the learning process, which are provided by human feedback and fuzzy rules designed by human users. Moreover, it is combined with Path Integrals Policy Improvement (PI2), to realize hybrid intelligence, which is used to realize fast robot skill learning. Throwing Movement Primitives proposed in this article is used to represent the policy of ball-throwing skill. According to the simulation results, the learning efficiency of our method is increased by 72% and 42.86%, respectively, compared with pure PI2 and PI2+COACH. Our method validated in experiments is 46.67% more effective than PI2+COACH. The results also show that the performance of our method is not affected by users' knowledge level of the related field. It is proven that PI2+Episode-Fuzzy-COACH is available for fast robot skill learning.

AB - To realize robot skill learning in the real world, reinforcement learning algorithms need to be applied in continuous problems with high sample efficiency. Hybrid intelligence is regarded as an available solution for this problem, due to the ability to speed up the learning process with human knowledge and experience. Therefore, we propose Episode-Fuzzy-COACH (COrrective Advice Communicated by Humans), to imitate human fuzzy logic and involve human intelligence in the learning process. In this framework, human knowledge and experience are involved in the learning process, which are provided by human feedback and fuzzy rules designed by human users. Moreover, it is combined with Path Integrals Policy Improvement (PI2), to realize hybrid intelligence, which is used to realize fast robot skill learning. Throwing Movement Primitives proposed in this article is used to represent the policy of ball-throwing skill. According to the simulation results, the learning efficiency of our method is increased by 72% and 42.86%, respectively, compared with pure PI2 and PI2+COACH. Our method validated in experiments is 46.67% more effective than PI2+COACH. The results also show that the performance of our method is not affected by users' knowledge level of the related field. It is proven that PI2+Episode-Fuzzy-COACH is available for fast robot skill learning.

KW - Hybrid intelligence

KW - interactive reinforcement learning

KW - robot skill learning

UR - http://www.scopus.com/inward/record.url?scp=85165411043&partnerID=8YFLogxK

U2 - 10.1109/TIE.2023.3294600

DO - 10.1109/TIE.2023.3294600

M3 - 文章

AN - SCOPUS:85165411043

SN - 0278-0046

VL - 71

SP - 5931

EP - 5940

JO - IEEE Transactions on Industrial Electronics

JF - IEEE Transactions on Industrial Electronics

IS - 6

ER -

Episode-Fuzzy-COACH Method for Fast Robot Skill Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this