Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System

Qingshuang Sun, Yuan Yao, Peng Yi, Xingshe Zhou, Gang Yang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Centralized training and decentralized execution have become a basic setting for multi-agent reinforcement learning. As the number of agents increases, the performance of the actors that only use their own local observations with centralized critics is prone to bottlenecks in complex scenarios. Recent research has shown that agents learn when to communicate to share information efficiently, that agents communicate with each other in a right time during the execution phase to complete the cooperation task. Therefore, in this paper, we proposed a model that learn when to communicate under the centralized critic supporting, so that the agent is able to adaptive control communication under the centralized critic learned by global environmental information. Experiments in a cooperation scenario demonstrate the advantages of model. With our proposed cooperation model, agents are able to block communication at an appropriate time under the centralized critic setting and cooperation with each other at the task.

Original languageEnglish
Title of host publicationComputer Supported Cooperative Work and Social Computing - 16th CCF Conference, ChineseCSCW 2021, Revised Selected Papers
EditorsYuqing Sun, Tun Lu, Buqing Cao, Hongfei Fan, Dongning Liu, Bowen Du, Liping Gao
PublisherSpringer Science and Business Media Deutschland GmbH
Pages134-146
Number of pages13
ISBN (Print)9789811945489
DOIs
StatePublished - 2022
Event16th CCF Conference on Computer Supported Cooperative Work and Social Computing, ChineseCSCW 2021 - Virtual, Online
Duration: 26 Nov 202128 Nov 2021

Publication series

NameCommunications in Computer and Information Science
Volume1492 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference16th CCF Conference on Computer Supported Cooperative Work and Social Computing, ChineseCSCW 2021
CityVirtual, Online
Period26/11/2128/11/21

Keywords

  • Centralized critic
  • Communication
  • Cooperation
  • Multi-agent
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Learning When to Communicate Among Actors with the Centralized Critic for the Multi-agent System'. Together they form a unique fingerprint.

Cite this