A collaboration of multi-agent model using an interactive interface

Jingchen Li; Fan Wu; Haobin Shi; Kao Shing Hwang

doi:10.1016/j.ins.2022.07.052

A collaboration of multi-agent model using an interactive interface

Jingchen Li, Fan Wu, Haobin Shi, Kao Shing Hwang

School of Computer Science

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

Multi-agent reinforcement learning algorithms scarcely attend to noisy environments, in which agents are inhibited from achieving optimal policy training and making correct decisions. This work investigates the effect of noises in multi-agent environments and proposes a multi-agent actor-critic with collaboration (MACC) model. The model uses lightweight communication to overcome the interference from noises. There are two policies for each agent in MACC: collaboration policy and behavior policy. The behavior of an agent not only depends on its own state but also be influenced by each other agent through a scalar, collaboration value. The collaboration value is generated by the collaboration policy for each individual agent, and it ensures a succinct consensus about the environment. This paper elaborates on the training of the collaboration policy and specifies how it coordinates the behavior policy in a manner of temporal abstraction mechanism, while the observation sequence is considered for more accurate perception. Several experiments on multi-agent collaboration simulation platforms demonstrate that the MACC performs better than baselines in noisy environments, especially in partially observable environments.

Original language	English
Pages (from-to)	349-363
Number of pages	15
Journal	Information Sciences
Volume	611
DOIs	https://doi.org/10.1016/j.ins.2022.07.052
State	Published - Sep 2022

Keywords

Actor-critic
Multi-agent reinforcement learning
Partial observable environment

Access to Document

10.1016/j.ins.2022.07.052

Cite this

@article{8d9b57fb5f434563b01bcb900ba9250f,

title = "A collaboration of multi-agent model using an interactive interface",

abstract = "Multi-agent reinforcement learning algorithms scarcely attend to noisy environments, in which agents are inhibited from achieving optimal policy training and making correct decisions. This work investigates the effect of noises in multi-agent environments and proposes a multi-agent actor-critic with collaboration (MACC) model. The model uses lightweight communication to overcome the interference from noises. There are two policies for each agent in MACC: collaboration policy and behavior policy. The behavior of an agent not only depends on its own state but also be influenced by each other agent through a scalar, collaboration value. The collaboration value is generated by the collaboration policy for each individual agent, and it ensures a succinct consensus about the environment. This paper elaborates on the training of the collaboration policy and specifies how it coordinates the behavior policy in a manner of temporal abstraction mechanism, while the observation sequence is considered for more accurate perception. Several experiments on multi-agent collaboration simulation platforms demonstrate that the MACC performs better than baselines in noisy environments, especially in partially observable environments.",

keywords = "Actor-critic, Multi-agent reinforcement learning, Partial observable environment",

author = "Jingchen Li and Fan Wu and Haobin Shi and Hwang, \{Kao Shing\}",

note = "Publisher Copyright: {\textcopyright} 2022 Elsevier Inc.",

year = "2022",

month = sep,

doi = "10.1016/j.ins.2022.07.052",

language = "英语",

volume = "611",

pages = "349--363",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - A collaboration of multi-agent model using an interactive interface

AU - Li, Jingchen

AU - Wu, Fan

AU - Shi, Haobin

AU - Hwang, Kao Shing

PY - 2022/9

Y1 - 2022/9

N2 - Multi-agent reinforcement learning algorithms scarcely attend to noisy environments, in which agents are inhibited from achieving optimal policy training and making correct decisions. This work investigates the effect of noises in multi-agent environments and proposes a multi-agent actor-critic with collaboration (MACC) model. The model uses lightweight communication to overcome the interference from noises. There are two policies for each agent in MACC: collaboration policy and behavior policy. The behavior of an agent not only depends on its own state but also be influenced by each other agent through a scalar, collaboration value. The collaboration value is generated by the collaboration policy for each individual agent, and it ensures a succinct consensus about the environment. This paper elaborates on the training of the collaboration policy and specifies how it coordinates the behavior policy in a manner of temporal abstraction mechanism, while the observation sequence is considered for more accurate perception. Several experiments on multi-agent collaboration simulation platforms demonstrate that the MACC performs better than baselines in noisy environments, especially in partially observable environments.

AB - Multi-agent reinforcement learning algorithms scarcely attend to noisy environments, in which agents are inhibited from achieving optimal policy training and making correct decisions. This work investigates the effect of noises in multi-agent environments and proposes a multi-agent actor-critic with collaboration (MACC) model. The model uses lightweight communication to overcome the interference from noises. There are two policies for each agent in MACC: collaboration policy and behavior policy. The behavior of an agent not only depends on its own state but also be influenced by each other agent through a scalar, collaboration value. The collaboration value is generated by the collaboration policy for each individual agent, and it ensures a succinct consensus about the environment. This paper elaborates on the training of the collaboration policy and specifies how it coordinates the behavior policy in a manner of temporal abstraction mechanism, while the observation sequence is considered for more accurate perception. Several experiments on multi-agent collaboration simulation platforms demonstrate that the MACC performs better than baselines in noisy environments, especially in partially observable environments.

KW - Actor-critic

KW - Multi-agent reinforcement learning

KW - Partial observable environment

UR - http://www.scopus.com/inward/record.url?scp=85136475012&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2022.07.052

DO - 10.1016/j.ins.2022.07.052

M3 - 文章

AN - SCOPUS:85136475012

SN - 0020-0255

VL - 611

SP - 349

EP - 363

JO - Information Sciences

JF - Information Sciences

ER -

A collaboration of multi-agent model using an interactive interface

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this