A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning

Wenbai Chen; Haobin Shi; Jingchen Li; Kao Shing Hwang

doi:10.1007/s40815-020-01035-0

A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning

Wenbai Chen, Haobin Shi, Jingchen Li, Kao Shing Hwang

National Sun Yat-sen University

Research output: Contribution to journal › Article › peer-review

8 Scopus citations

Abstract

Many works provide intrinsic rewards to deal with sparse rewards in reinforcement learning. Due to the non-stationarity of multi-agent systems, it is impracticable to apply existing methods to multi-agent reinforcement learning directly. In this paper, a fuzzy curiosity-driven mechanism is proposed for multi-agent reinforcement learning, by which agents can explore more efficiently in a scenario with sparse extrinsic reward. First, we improve the variational auto-encoder to predict the next state through the joint-state and joint-action for agents. Then several fuzzy partitions are built according to the next joint-state, which aims at assigning the prediction error to different agents. With the proposed method, each agent in the multi-agent environment receives its individual intrinsic reward. We elaborate on the proposed method in partially observable environments and fully observable environments separately. Experimental results show that multi-agent learns joint policies more efficiently by the proposed fuzzy curiosity-driven mechanism, and it can also help agents find better policies in the training process.

Original language	English
Pages (from-to)	1222-1233
Number of pages	12
Journal	International Journal of Fuzzy Systems
Volume	23
Issue number	5
DOIs	https://doi.org/10.1007/s40815-020-01035-0
State	Published - Jul 2021
Externally published	Yes

Keywords

CURIOSITY-driven
Multi-agent system
Reinforcement learning

Access to Document

10.1007/s40815-020-01035-0

Cite this

@article{f5c3fc63ff994c5998e40985df52fef8,

title = "A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning",

abstract = "Many works provide intrinsic rewards to deal with sparse rewards in reinforcement learning. Due to the non-stationarity of multi-agent systems, it is impracticable to apply existing methods to multi-agent reinforcement learning directly. In this paper, a fuzzy curiosity-driven mechanism is proposed for multi-agent reinforcement learning, by which agents can explore more efficiently in a scenario with sparse extrinsic reward. First, we improve the variational auto-encoder to predict the next state through the joint-state and joint-action for agents. Then several fuzzy partitions are built according to the next joint-state, which aims at assigning the prediction error to different agents. With the proposed method, each agent in the multi-agent environment receives its individual intrinsic reward. We elaborate on the proposed method in partially observable environments and fully observable environments separately. Experimental results show that multi-agent learns joint policies more efficiently by the proposed fuzzy curiosity-driven mechanism, and it can also help agents find better policies in the training process.",

keywords = "CURIOSITY-driven, Multi-agent system, Reinforcement learning",

author = "Wenbai Chen and Haobin Shi and Jingchen Li and Hwang, {Kao Shing}",

note = "Publisher Copyright: {\textcopyright} 2021, Taiwan Fuzzy Systems Association.",

year = "2021",

month = jul,

doi = "10.1007/s40815-020-01035-0",

language = "英语",

volume = "23",

pages = "1222--1233",

journal = "International Journal of Fuzzy Systems",

issn = "1562-2479",

publisher = "Springer International Publishing AG",

number = "5",

}

TY - JOUR

T1 - A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning

AU - Chen, Wenbai

AU - Shi, Haobin

AU - Li, Jingchen

AU - Hwang, Kao Shing

PY - 2021/7

Y1 - 2021/7

N2 - Many works provide intrinsic rewards to deal with sparse rewards in reinforcement learning. Due to the non-stationarity of multi-agent systems, it is impracticable to apply existing methods to multi-agent reinforcement learning directly. In this paper, a fuzzy curiosity-driven mechanism is proposed for multi-agent reinforcement learning, by which agents can explore more efficiently in a scenario with sparse extrinsic reward. First, we improve the variational auto-encoder to predict the next state through the joint-state and joint-action for agents. Then several fuzzy partitions are built according to the next joint-state, which aims at assigning the prediction error to different agents. With the proposed method, each agent in the multi-agent environment receives its individual intrinsic reward. We elaborate on the proposed method in partially observable environments and fully observable environments separately. Experimental results show that multi-agent learns joint policies more efficiently by the proposed fuzzy curiosity-driven mechanism, and it can also help agents find better policies in the training process.

AB - Many works provide intrinsic rewards to deal with sparse rewards in reinforcement learning. Due to the non-stationarity of multi-agent systems, it is impracticable to apply existing methods to multi-agent reinforcement learning directly. In this paper, a fuzzy curiosity-driven mechanism is proposed for multi-agent reinforcement learning, by which agents can explore more efficiently in a scenario with sparse extrinsic reward. First, we improve the variational auto-encoder to predict the next state through the joint-state and joint-action for agents. Then several fuzzy partitions are built according to the next joint-state, which aims at assigning the prediction error to different agents. With the proposed method, each agent in the multi-agent environment receives its individual intrinsic reward. We elaborate on the proposed method in partially observable environments and fully observable environments separately. Experimental results show that multi-agent learns joint policies more efficiently by the proposed fuzzy curiosity-driven mechanism, and it can also help agents find better policies in the training process.

KW - CURIOSITY-driven

KW - Multi-agent system

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85100833582&partnerID=8YFLogxK

U2 - 10.1007/s40815-020-01035-0

DO - 10.1007/s40815-020-01035-0

M3 - 文章

AN - SCOPUS:85100833582

SN - 1562-2479

VL - 23

SP - 1222

EP - 1233

JO - International Journal of Fuzzy Systems

JF - International Journal of Fuzzy Systems

IS - 5

ER -

A Fuzzy Curiosity-Driven Mechanism for Multi-Agent Reinforcement Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this