Multi-agent reinforcement learning by the actor-critic model with an attention interface

Lixiang Zhang; Jingchen Li; Yi'an Zhu; Haobin Shi; Kao Shing Hwang

doi:10.1016/j.neucom.2021.06.049

Multi-agent reinforcement learning by the actor-critic model with an attention interface

Lixiang Zhang, Jingchen Li, Yi'an Zhu, Haobin Shi, Kao Shing Hwang

School of Computer Science

Research output: Contribution to journal › Article › peer-review

20 Scopus citations

Abstract

Multi-agent reinforcement learning algorithms have achieved satisfactory performances in various scenarios, but many of them encounter difficulties in partially observable environments. In partially observable environments, the inability to perceive environment states results in unsteadiness and misconvergence, especially in large-scale multi-agent environments. To improve interactions among homogeneous agents in a partially observable environment, we propose a novel multi-agent actor-critic model with a visual attention interface to solve this problem. First, a recurrent visual attention interface is used to extract a latent state from each agent's partial observation. These latent states allow agents to focus on several local environments, in which each agent has a complete perception of a local environment and the intricate multi-agent environment is teased out by the interaction among several agents in the same local environment. The proposed method trains multi-agent systems with a centralized training and decentralized execution mechanism. The joint action of agents is approximated by the mean-field theory because the number of agents in a local environment is uncertain. Experimental results on the simulation platform suggest that our model performs better when training large-scale multi-agent systems in partially observable environments than baselines.

Original language	English
Pages (from-to)	275-284
Number of pages	10
Journal	Neurocomputing
Volume	471
DOIs	https://doi.org/10.1016/j.neucom.2021.06.049
State	Published - 30 Jan 2022

Keywords

Actor-critic
Attention mechanism
Mean-field theory
Multi-agent reinforcement learning
Multi-agent system

Access to Document

10.1016/j.neucom.2021.06.049

Cite this

@article{d50cef9990bb44dbaadc71926ed4deb8,

title = "Multi-agent reinforcement learning by the actor-critic model with an attention interface",

abstract = "Multi-agent reinforcement learning algorithms have achieved satisfactory performances in various scenarios, but many of them encounter difficulties in partially observable environments. In partially observable environments, the inability to perceive environment states results in unsteadiness and misconvergence, especially in large-scale multi-agent environments. To improve interactions among homogeneous agents in a partially observable environment, we propose a novel multi-agent actor-critic model with a visual attention interface to solve this problem. First, a recurrent visual attention interface is used to extract a latent state from each agent's partial observation. These latent states allow agents to focus on several local environments, in which each agent has a complete perception of a local environment and the intricate multi-agent environment is teased out by the interaction among several agents in the same local environment. The proposed method trains multi-agent systems with a centralized training and decentralized execution mechanism. The joint action of agents is approximated by the mean-field theory because the number of agents in a local environment is uncertain. Experimental results on the simulation platform suggest that our model performs better when training large-scale multi-agent systems in partially observable environments than baselines.",

keywords = "Actor-critic, Attention mechanism, Mean-field theory, Multi-agent reinforcement learning, Multi-agent system",

author = "Lixiang Zhang and Jingchen Li and Yi'an Zhu and Haobin Shi and Hwang, {Kao Shing}",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier B.V.",

year = "2022",

month = jan,

day = "30",

doi = "10.1016/j.neucom.2021.06.049",

language = "英语",

volume = "471",

pages = "275--284",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Multi-agent reinforcement learning by the actor-critic model with an attention interface

AU - Zhang, Lixiang

AU - Li, Jingchen

AU - Zhu, Yi'an

AU - Shi, Haobin

AU - Hwang, Kao Shing

PY - 2022/1/30

Y1 - 2022/1/30

N2 - Multi-agent reinforcement learning algorithms have achieved satisfactory performances in various scenarios, but many of them encounter difficulties in partially observable environments. In partially observable environments, the inability to perceive environment states results in unsteadiness and misconvergence, especially in large-scale multi-agent environments. To improve interactions among homogeneous agents in a partially observable environment, we propose a novel multi-agent actor-critic model with a visual attention interface to solve this problem. First, a recurrent visual attention interface is used to extract a latent state from each agent's partial observation. These latent states allow agents to focus on several local environments, in which each agent has a complete perception of a local environment and the intricate multi-agent environment is teased out by the interaction among several agents in the same local environment. The proposed method trains multi-agent systems with a centralized training and decentralized execution mechanism. The joint action of agents is approximated by the mean-field theory because the number of agents in a local environment is uncertain. Experimental results on the simulation platform suggest that our model performs better when training large-scale multi-agent systems in partially observable environments than baselines.

AB - Multi-agent reinforcement learning algorithms have achieved satisfactory performances in various scenarios, but many of them encounter difficulties in partially observable environments. In partially observable environments, the inability to perceive environment states results in unsteadiness and misconvergence, especially in large-scale multi-agent environments. To improve interactions among homogeneous agents in a partially observable environment, we propose a novel multi-agent actor-critic model with a visual attention interface to solve this problem. First, a recurrent visual attention interface is used to extract a latent state from each agent's partial observation. These latent states allow agents to focus on several local environments, in which each agent has a complete perception of a local environment and the intricate multi-agent environment is teased out by the interaction among several agents in the same local environment. The proposed method trains multi-agent systems with a centralized training and decentralized execution mechanism. The joint action of agents is approximated by the mean-field theory because the number of agents in a local environment is uncertain. Experimental results on the simulation platform suggest that our model performs better when training large-scale multi-agent systems in partially observable environments than baselines.

KW - Actor-critic

KW - Attention mechanism

KW - Mean-field theory

KW - Multi-agent reinforcement learning

KW - Multi-agent system

UR - http://www.scopus.com/inward/record.url?scp=85109028451&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2021.06.049

DO - 10.1016/j.neucom.2021.06.049

M3 - 文章

AN - SCOPUS:85109028451

SN - 0925-2312

VL - 471

SP - 275

EP - 284

JO - Neurocomputing

JF - Neurocomputing

ER -

Multi-agent reinforcement learning by the actor-critic model with an attention interface

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this