Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System

Qingshuang Sun; Yuan Yao; Peng Yi; Xingshe Zhou; Gang Yang

doi:10.1007/978-981-19-4549-6_11

Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System

Qingshuang Sun, Yuan Yao, Peng Yi, Xingshe Zhou, Gang Yang

School of Computer Science

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

Centralized training and decentralized execution have become a basic setting for multi-agent reinforcement learning. As the number of agents increases, the performance of the actors that only use their own local observations with centralized critics is prone to bottlenecks in complex scenarios. Recent research has shown that agents learn when to communicate to share information efficiently, that agents communicate with each other in a right time during the execution phase to complete the cooperation task. Therefore, in this paper, we proposed a model that learn when to communicate under the centralized critic supporting, so that the agent is able to adaptive control communication under the centralized critic learned by global environmental information. Experiments in a cooperation scenario demonstrate the advantages of model. With our proposed cooperation model, agents are able to block communication at an appropriate time under the centralized critic setting and cooperation with each other at the task.

Original language	English
Title of host publication	Computer Supported Cooperative Work and Social Computing - 16th CCF Conference, ChineseCSCW 2021, Revised Selected Papers
Editors	Yuqing Sun, Tun Lu, Buqing Cao, Hongfei Fan, Dongning Liu, Bowen Du, Liping Gao
Publisher	Springer Science and Business Media Deutschland GmbH
Pages	134-146
Number of pages	13
ISBN (Print)	9789811945489
DOIs	https://doi.org/10.1007/978-981-19-4549-6_11
State	Published - 2022
Event	16th CCF Conference on Computer Supported Cooperative Work and Social Computing, ChineseCSCW 2021 - Virtual, Online Duration: 26 Nov 2021 → 28 Nov 2021

Publication series

Name	Communications in Computer and Information Science
Volume	1492 CCIS
ISSN (Print)	1865-0929
ISSN (Electronic)	1865-0937

Conference

Conference	16th CCF Conference on Computer Supported Cooperative Work and Social Computing, ChineseCSCW 2021
City	Virtual, Online
Period	26/11/21 → 28/11/21

Keywords

Centralized critic
Communication
Cooperation
Multi-agent
Reinforcement learning

Access to Document

10.1007/978-981-19-4549-6_11

Cite this

Sun, Q., Yao, Y., Yi, P., Zhou, X., & Yang, G. (2022). Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System. In Y. Sun, T. Lu, B. Cao, H. Fan, D. Liu, B. Du, & L. Gao (Eds.), Computer Supported Cooperative Work and Social Computing - 16th CCF Conference, ChineseCSCW 2021, Revised Selected Papers (pp. 134-146). (Communications in Computer and Information Science; Vol. 1492 CCIS). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-981-19-4549-6_11

Sun, Qingshuang ; Yao, Yuan ; Yi, Peng et al. / Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System. Computer Supported Cooperative Work and Social Computing - 16th CCF Conference, ChineseCSCW 2021, Revised Selected Papers. editor / Yuqing Sun ; Tun Lu ; Buqing Cao ; Hongfei Fan ; Dongning Liu ; Bowen Du ; Liping Gao. Springer Science and Business Media Deutschland GmbH, 2022. pp. 134-146 (Communications in Computer and Information Science).

@inproceedings{c5e1acdef66844e699fd67f55b7ce2da,

title = "Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System",

abstract = "Centralized training and decentralized execution have become a basic setting for multi-agent reinforcement learning. As the number of agents increases, the performance of the actors that only use their own local observations with centralized critics is prone to bottlenecks in complex scenarios. Recent research has shown that agents learn when to communicate to share information efficiently, that agents communicate with each other in a right time during the execution phase to complete the cooperation task. Therefore, in this paper, we proposed a model that learn when to communicate under the centralized critic supporting, so that the agent is able to adaptive control communication under the centralized critic learned by global environmental information. Experiments in a cooperation scenario demonstrate the advantages of model. With our proposed cooperation model, agents are able to block communication at an appropriate time under the centralized critic setting and cooperation with each other at the task.",

keywords = "Centralized critic, Communication, Cooperation, Multi-agent, Reinforcement learning",

author = "Qingshuang Sun and Yuan Yao and Peng Yi and Xingshe Zhou and Gang Yang",

note = "Publisher Copyright: {\textcopyright} 2022, Springer Nature Singapore Pte Ltd.; 16th CCF Conference on Computer Supported Cooperative Work and Social Computing, ChineseCSCW 2021 ; Conference date: 26-11-2021 Through 28-11-2021",

year = "2022",

doi = "10.1007/978-981-19-4549-6_11",

language = "英语",

isbn = "9789811945489",

series = "Communications in Computer and Information Science",

publisher = "Springer Science and Business Media Deutschland GmbH",

pages = "134--146",

editor = "Yuqing Sun and Tun Lu and Buqing Cao and Hongfei Fan and Dongning Liu and Bowen Du and Liping Gao",

booktitle = "Computer Supported Cooperative Work and Social Computing - 16th CCF Conference, ChineseCSCW 2021, Revised Selected Papers",

}

Sun, Q, Yao, Y, Yi, P, Zhou, X & Yang, G 2022, Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System. in Y Sun, T Lu, B Cao, H Fan, D Liu, B Du & L Gao (eds), Computer Supported Cooperative Work and Social Computing - 16th CCF Conference, ChineseCSCW 2021, Revised Selected Papers. Communications in Computer and Information Science, vol. 1492 CCIS, Springer Science and Business Media Deutschland GmbH, pp. 134-146, 16th CCF Conference on Computer Supported Cooperative Work and Social Computing, ChineseCSCW 2021, Virtual, Online, 26/11/21. https://doi.org/10.1007/978-981-19-4549-6_11

Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System. / Sun, Qingshuang; Yao, Yuan; Yi, Peng et al.
Computer Supported Cooperative Work and Social Computing - 16th CCF Conference, ChineseCSCW 2021, Revised Selected Papers. ed. / Yuqing Sun; Tun Lu; Buqing Cao; Hongfei Fan; Dongning Liu; Bowen Du; Liping Gao. Springer Science and Business Media Deutschland GmbH, 2022. p. 134-146 (Communications in Computer and Information Science; Vol. 1492 CCIS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System

AU - Sun, Qingshuang

AU - Yao, Yuan

AU - Yi, Peng

AU - Zhou, Xingshe

AU - Yang, Gang

PY - 2022

Y1 - 2022

N2 - Centralized training and decentralized execution have become a basic setting for multi-agent reinforcement learning. As the number of agents increases, the performance of the actors that only use their own local observations with centralized critics is prone to bottlenecks in complex scenarios. Recent research has shown that agents learn when to communicate to share information efficiently, that agents communicate with each other in a right time during the execution phase to complete the cooperation task. Therefore, in this paper, we proposed a model that learn when to communicate under the centralized critic supporting, so that the agent is able to adaptive control communication under the centralized critic learned by global environmental information. Experiments in a cooperation scenario demonstrate the advantages of model. With our proposed cooperation model, agents are able to block communication at an appropriate time under the centralized critic setting and cooperation with each other at the task.

AB - Centralized training and decentralized execution have become a basic setting for multi-agent reinforcement learning. As the number of agents increases, the performance of the actors that only use their own local observations with centralized critics is prone to bottlenecks in complex scenarios. Recent research has shown that agents learn when to communicate to share information efficiently, that agents communicate with each other in a right time during the execution phase to complete the cooperation task. Therefore, in this paper, we proposed a model that learn when to communicate under the centralized critic supporting, so that the agent is able to adaptive control communication under the centralized critic learned by global environmental information. Experiments in a cooperation scenario demonstrate the advantages of model. With our proposed cooperation model, agents are able to block communication at an appropriate time under the centralized critic setting and cooperation with each other at the task.

KW - Centralized critic

KW - Communication

KW - Cooperation

KW - Multi-agent

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85135074685&partnerID=8YFLogxK

U2 - 10.1007/978-981-19-4549-6_11

DO - 10.1007/978-981-19-4549-6_11

M3 - 会议稿件

AN - SCOPUS:85135074685

SN - 9789811945489

T3 - Communications in Computer and Information Science

SP - 134

EP - 146

BT - Computer Supported Cooperative Work and Social Computing - 16th CCF Conference, ChineseCSCW 2021, Revised Selected Papers

A2 - Sun, Yuqing

A2 - Lu, Tun

A2 - Cao, Buqing

A2 - Fan, Hongfei

A2 - Liu, Dongning

A2 - Du, Bowen

A2 - Gao, Liping

PB - Springer Science and Business Media Deutschland GmbH

T2 - 16th CCF Conference on Computer Supported Cooperative Work and Social Computing, ChineseCSCW 2021

Y2 - 26 November 2021 through 28 November 2021

ER -

Sun Q, Yao Y, Yi P, Zhou X, Yang G. Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System. In Sun Y, Lu T, Cao B, Fan H, Liu D, Du B, Gao L, editors, Computer Supported Cooperative Work and Social Computing - 16th CCF Conference, ChineseCSCW 2021, Revised Selected Papers. Springer Science and Business Media Deutschland GmbH. 2022. p. 134-146. (Communications in Computer and Information Science). doi: 10.1007/978-981-19-4549-6_11