Mix-attention approximation for homogeneous large-scale multi-agent reinforcement learning

Yang Shike; Li Jingchen; Shi Haobin

doi:10.1007/s00521-022-07880-4

Mix-attention approximation for homogeneous large-scale multi-agent reinforcement learning

Yang Shike, Li Jingchen, Shi Haobin

School of Computer Science

Research output: Contribution to journal › Article › peer-review

3 Scopus citations

Abstract

In large-scale multi-agent environments with homogeneous agents, most works provided approximation methods to simplify the interaction among agents. In this work, we propose a new approximation, termed mix-attention approximation, to enhance multi-agent reinforcement learning. The approximation is made by a mix-attention module, used to form consistent consensuses for agents in partially observable environments. We leverage the hard attention to compress the perception of each agent to some more partial regions. These partial regions can engage the attention of several agents at the same time, and the correlation among these partial regions is generated by a soft-attention module. We give the training method for the mix-attention mechanism and discuss the consistency between the mix-attention module and the policy network. Then we analyze the feasibility of this mix-attention-based approximation, attempting to build integrated models of our method into other approximation methods. In large-scale multi-agent environments, the proposal can be embedded into most reinforcement learning methods, and extensive experiments on multi-agent scenarios demonstrate the effectiveness of the proposed approach.

Original language	English
Pages (from-to)	3143-3154
Number of pages	12
Journal	Neural Computing and Applications
Volume	35
Issue number	4
DOIs	https://doi.org/10.1007/s00521-022-07880-4
State	Published - Feb 2023

Keywords

Attention mechanism
Homogeneous multi-agent system
Large-scale multi-agent system
Reinforcement learning

Access to Document

10.1007/s00521-022-07880-4

Cite this

@article{ea12ab38fd8c4a53b7bd9eacc27fa513,

title = "Mix-attention approximation for homogeneous large-scale multi-agent reinforcement learning",

abstract = "In large-scale multi-agent environments with homogeneous agents, most works provided approximation methods to simplify the interaction among agents. In this work, we propose a new approximation, termed mix-attention approximation, to enhance multi-agent reinforcement learning. The approximation is made by a mix-attention module, used to form consistent consensuses for agents in partially observable environments. We leverage the hard attention to compress the perception of each agent to some more partial regions. These partial regions can engage the attention of several agents at the same time, and the correlation among these partial regions is generated by a soft-attention module. We give the training method for the mix-attention mechanism and discuss the consistency between the mix-attention module and the policy network. Then we analyze the feasibility of this mix-attention-based approximation, attempting to build integrated models of our method into other approximation methods. In large-scale multi-agent environments, the proposal can be embedded into most reinforcement learning methods, and extensive experiments on multi-agent scenarios demonstrate the effectiveness of the proposed approach.",

keywords = "Attention mechanism, Homogeneous multi-agent system, Large-scale multi-agent system, Reinforcement learning",

author = "Yang Shike and Li Jingchen and Shi Haobin",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature.",

year = "2023",

month = feb,

doi = "10.1007/s00521-022-07880-4",

language = "英语",

volume = "35",

pages = "3143--3154",

journal = "Neural Computing and Applications",

issn = "0941-0643",

publisher = "Springer London",

number = "4",

}

TY - JOUR

T1 - Mix-attention approximation for homogeneous large-scale multi-agent reinforcement learning

AU - Shike, Yang

AU - Jingchen, Li

AU - Haobin, Shi

PY - 2023/2

Y1 - 2023/2

N2 - In large-scale multi-agent environments with homogeneous agents, most works provided approximation methods to simplify the interaction among agents. In this work, we propose a new approximation, termed mix-attention approximation, to enhance multi-agent reinforcement learning. The approximation is made by a mix-attention module, used to form consistent consensuses for agents in partially observable environments. We leverage the hard attention to compress the perception of each agent to some more partial regions. These partial regions can engage the attention of several agents at the same time, and the correlation among these partial regions is generated by a soft-attention module. We give the training method for the mix-attention mechanism and discuss the consistency between the mix-attention module and the policy network. Then we analyze the feasibility of this mix-attention-based approximation, attempting to build integrated models of our method into other approximation methods. In large-scale multi-agent environments, the proposal can be embedded into most reinforcement learning methods, and extensive experiments on multi-agent scenarios demonstrate the effectiveness of the proposed approach.

AB - In large-scale multi-agent environments with homogeneous agents, most works provided approximation methods to simplify the interaction among agents. In this work, we propose a new approximation, termed mix-attention approximation, to enhance multi-agent reinforcement learning. The approximation is made by a mix-attention module, used to form consistent consensuses for agents in partially observable environments. We leverage the hard attention to compress the perception of each agent to some more partial regions. These partial regions can engage the attention of several agents at the same time, and the correlation among these partial regions is generated by a soft-attention module. We give the training method for the mix-attention mechanism and discuss the consistency between the mix-attention module and the policy network. Then we analyze the feasibility of this mix-attention-based approximation, attempting to build integrated models of our method into other approximation methods. In large-scale multi-agent environments, the proposal can be embedded into most reinforcement learning methods, and extensive experiments on multi-agent scenarios demonstrate the effectiveness of the proposed approach.

KW - Attention mechanism

KW - Homogeneous multi-agent system

KW - Large-scale multi-agent system

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85139502141&partnerID=8YFLogxK

U2 - 10.1007/s00521-022-07880-4

DO - 10.1007/s00521-022-07880-4

M3 - 文章

AN - SCOPUS:85139502141

SN - 0941-0643

VL - 35

SP - 3143

EP - 3154

JO - Neural Computing and Applications

JF - Neural Computing and Applications

IS - 4

ER -

Mix-attention approximation for homogeneous large-scale multi-agent reinforcement learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this