Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning

Tong Li, Chenjia Bai, Kang Xu, Chen Chu, Peican Zhu, Zhen Wang

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

With the popularization of intelligence, the necessity of cooperation between intelligent machines makes the research of collaborative multi-agent reinforcement learning (MARL) more extensive. Existing approaches typically address this challenge through task decomposition of the environment or role classification of agents. However, these studies may rely on the sharing of parameters between agents, resulting in the homogeneity of agent behavior, which is not effective for complex tasks. Or training that relies on external rewards is difficult to adapt to scenarios with sparse rewards. Based on the above challenges, in this paper we propose a novel dynamic skill learning (DSL) framework for agents to learn more diverse abilities motivated by internal rewards. Specifically, the DSL has two components: (i) Dynamic skill discovery, which encourages the production of meaningful skills by exploring the environment in an unsupervised manner, using the inner product between a skill vector and a trajectory representation to generate intrinsic rewards. Meanwhile, the Lipschitz constraint of the state representation function is used to ensure the proper trajectory of the learned skills. (ii) Dynamic skill assignment, which utilizes a policy controller to assign skills to each agent based on its different trajectory latent variables. In addition, in order to avoid training instability caused by frequent changes in skill selection, we introduce a regularization term to limit skill switching between adjacent time steps. We thoroughly tested the DSL approach on two challenging benchmarks, StarCraft II and Google Research Football. Experimental results show that compared with strong benchmarks such as QMIX and RODE, DSL effectively improves performance and is more adaptable to difficult collaborative scenarios.

Original languageEnglish
Article number106852
JournalNeural Networks
Volume181
DOIs
StatePublished - Jan 2025

Keywords

  • Diverse behaviors
  • Multi-agent reinforcement learning
  • Skill assignment
  • Skill discovery

Fingerprint

Dive into the research topics of 'Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning'. Together they form a unique fingerprint.

Cite this