Neighborhood curiosity-based exploration in multi-agent reinforcement learning

Shike Yang; Ziming He; Jingchen Li; Haobin Shi; Qingbing Ji; Kao Shing Hwang; Xianshan Li

doi:10.1109/TCDS.2024.3460368

Neighborhood curiosity-based exploration in multi-agent reinforcement learning

Shike Yang, Ziming He, Jingchen Li, Haobin Shi, Qingbing Ji, Kao Shing Hwang, Xianshan Li

School of Computer Science

Research output: Contribution to journal › Article › peer-review

Abstract

Efficient exploration in cooperative multi-agent reinforcement learning is still tricky in complex tasks. In this paper, we propose a novel multi-agent collaborative exploration method called Neighborhood Curiosity-based Exploration (NCE), by which agents can explore not only novel states but also new partnerships. Concretely, we use the attention mechanism in graph convolutional networks to perform a weighted summation of features from neighbors. The calculated attention weights can be regarded as an embodiment of the relationship among agents. Then we use the prediction errors of the aggregated features as intrinsic rewards to facilitate exploration. When agents encounter novel states or new partnerships, NCE will produce large prediction errors, resulting in large intrinsic rewards. In addition, agents are more influenced by their neighbors and only interact directly with them in multi-agent systems. Exploring partnerships between agents and their neighbors can enable agents to capture the most important cooperative relations with other agents. Therefore, NCE can effectively promote collaborative exploration even in environments with a large number of agents. Our experimental results show that NCE achieves significant performance improvements on the challenging StarCraft II Micromanagement (SMAC) benchmark.

Original language	English
Journal	IEEE Transactions on Cognitive and Developmental Systems
DOIs	https://doi.org/10.1109/TCDS.2024.3460368
State	Accepted/In press - 2024

Keywords

Machine learning
Multi-agent reinforcement learning
Multi-agent system

Access to Document

10.1109/TCDS.2024.3460368

Cite this

@article{b761ca7f279c45b280725a9d632c9010,

title = "Neighborhood curiosity-based exploration in multi-agent reinforcement learning",

abstract = "Efficient exploration in cooperative multi-agent reinforcement learning is still tricky in complex tasks. In this paper, we propose a novel multi-agent collaborative exploration method called Neighborhood Curiosity-based Exploration (NCE), by which agents can explore not only novel states but also new partnerships. Concretely, we use the attention mechanism in graph convolutional networks to perform a weighted summation of features from neighbors. The calculated attention weights can be regarded as an embodiment of the relationship among agents. Then we use the prediction errors of the aggregated features as intrinsic rewards to facilitate exploration. When agents encounter novel states or new partnerships, NCE will produce large prediction errors, resulting in large intrinsic rewards. In addition, agents are more influenced by their neighbors and only interact directly with them in multi-agent systems. Exploring partnerships between agents and their neighbors can enable agents to capture the most important cooperative relations with other agents. Therefore, NCE can effectively promote collaborative exploration even in environments with a large number of agents. Our experimental results show that NCE achieves significant performance improvements on the challenging StarCraft II Micromanagement (SMAC) benchmark.",

keywords = "Machine learning, Multi-agent reinforcement learning, Multi-agent system",

author = "Shike Yang and Ziming He and Jingchen Li and Haobin Shi and Qingbing Ji and Hwang, {Kao Shing} and Xianshan Li",

note = "Publisher Copyright: {\textcopyright} 2016 IEEE.",

year = "2024",

doi = "10.1109/TCDS.2024.3460368",

language = "英语",

journal = "IEEE Transactions on Cognitive and Developmental Systems",

issn = "2379-8920",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Neighborhood curiosity-based exploration in multi-agent reinforcement learning

AU - Yang, Shike

AU - He, Ziming

AU - Li, Jingchen

AU - Shi, Haobin

AU - Ji, Qingbing

AU - Hwang, Kao Shing

AU - Li, Xianshan

PY - 2024

Y1 - 2024

N2 - Efficient exploration in cooperative multi-agent reinforcement learning is still tricky in complex tasks. In this paper, we propose a novel multi-agent collaborative exploration method called Neighborhood Curiosity-based Exploration (NCE), by which agents can explore not only novel states but also new partnerships. Concretely, we use the attention mechanism in graph convolutional networks to perform a weighted summation of features from neighbors. The calculated attention weights can be regarded as an embodiment of the relationship among agents. Then we use the prediction errors of the aggregated features as intrinsic rewards to facilitate exploration. When agents encounter novel states or new partnerships, NCE will produce large prediction errors, resulting in large intrinsic rewards. In addition, agents are more influenced by their neighbors and only interact directly with them in multi-agent systems. Exploring partnerships between agents and their neighbors can enable agents to capture the most important cooperative relations with other agents. Therefore, NCE can effectively promote collaborative exploration even in environments with a large number of agents. Our experimental results show that NCE achieves significant performance improvements on the challenging StarCraft II Micromanagement (SMAC) benchmark.

AB - Efficient exploration in cooperative multi-agent reinforcement learning is still tricky in complex tasks. In this paper, we propose a novel multi-agent collaborative exploration method called Neighborhood Curiosity-based Exploration (NCE), by which agents can explore not only novel states but also new partnerships. Concretely, we use the attention mechanism in graph convolutional networks to perform a weighted summation of features from neighbors. The calculated attention weights can be regarded as an embodiment of the relationship among agents. Then we use the prediction errors of the aggregated features as intrinsic rewards to facilitate exploration. When agents encounter novel states or new partnerships, NCE will produce large prediction errors, resulting in large intrinsic rewards. In addition, agents are more influenced by their neighbors and only interact directly with them in multi-agent systems. Exploring partnerships between agents and their neighbors can enable agents to capture the most important cooperative relations with other agents. Therefore, NCE can effectively promote collaborative exploration even in environments with a large number of agents. Our experimental results show that NCE achieves significant performance improvements on the challenging StarCraft II Micromanagement (SMAC) benchmark.

KW - Machine learning

KW - Multi-agent reinforcement learning

KW - Multi-agent system

UR - http://www.scopus.com/inward/record.url?scp=85204205140&partnerID=8YFLogxK

U2 - 10.1109/TCDS.2024.3460368

DO - 10.1109/TCDS.2024.3460368

M3 - 文章

AN - SCOPUS:85204205140

SN - 2379-8920

JO - IEEE Transactions on Cognitive and Developmental Systems

JF - IEEE Transactions on Cognitive and Developmental Systems

ER -

Neighborhood curiosity-based exploration in multi-agent reinforcement learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this