Neighborhood-Curiosity-Based Exploration in Multiagent Reinforcement Learning

Shike Yang; Ziming He; Jingchen Li; Haobin Shi; Qingbing Ji; Kao Shing Hwang; Xianshan Li

doi:10.1109/TCDS.2024.3460368

Neighborhood-Curiosity-Based Exploration in Multiagent Reinforcement Learning

Shike Yang, Ziming He, Jingchen Li, Haobin Shi, Qingbing Ji, Kao Shing Hwang, Xianshan Li

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Efficient exploration in cooperative multiagent reinforcement learning is still tricky in complex tasks. In this article, we propose a novel multiagent collaborative exploration method called neighborhood-curiosity-based exploration (NCE), by which agents can explore not only novel states but also new partnerships. Concretely, we use the attention mechanism in graph convolutional networks to perform a weighted summation of features from neighbors. The calculated attention weights can be regarded as an embodiment of the relationship among agents. Then, we use the prediction errors of the aggregated features as intrinsic rewards to facilitate exploration. When agents encounter novel states or new partnerships, NCE will produce large prediction errors, resulting in large intrinsic rewards. In addition, agents are more influenced by their neighbors and only interact directly with them in multiagent systems. Exploring partnerships between agents and their neighbors can enable agents to capture the most important cooperative relations with other agents. Therefore, NCE can effectively promote collaborative exploration even in environments with a large number of agents. Our experimental results show that NCE achieves significant performance improvements on the challenging StarCraft II micromanagement (SMAC) benchmark.

源语言	英语
页（从-至）	379-389
页数	11
期刊	IEEE Transactions on Cognitive and Developmental Systems
卷	17
期	2
DOI	https://doi.org/10.1109/TCDS.2024.3460368
出版状态	已出版 - 2025

访问文件

10.1109/TCDS.2024.3460368

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{3bba7da8394146b49299b0e56e6b6029,

title = "Neighborhood-Curiosity-Based Exploration in Multiagent Reinforcement Learning",

abstract = "Efficient exploration in cooperative multiagent reinforcement learning is still tricky in complex tasks. In this article, we propose a novel multiagent collaborative exploration method called neighborhood-curiosity-based exploration (NCE), by which agents can explore not only novel states but also new partnerships. Concretely, we use the attention mechanism in graph convolutional networks to perform a weighted summation of features from neighbors. The calculated attention weights can be regarded as an embodiment of the relationship among agents. Then, we use the prediction errors of the aggregated features as intrinsic rewards to facilitate exploration. When agents encounter novel states or new partnerships, NCE will produce large prediction errors, resulting in large intrinsic rewards. In addition, agents are more influenced by their neighbors and only interact directly with them in multiagent systems. Exploring partnerships between agents and their neighbors can enable agents to capture the most important cooperative relations with other agents. Therefore, NCE can effectively promote collaborative exploration even in environments with a large number of agents. Our experimental results show that NCE achieves significant performance improvements on the challenging StarCraft II micromanagement (SMAC) benchmark.",

keywords = "multiagent reinforcement learning (MARL), multiagent system",

author = "Shike Yang and Ziming He and Jingchen Li and Haobin Shi and Qingbing Ji and Hwang, {Kao Shing} and Xianshan Li",

note = "Publisher Copyright: {\textcopyright} 2016 IEEE.",

year = "2025",

doi = "10.1109/TCDS.2024.3460368",

language = "英语",

volume = "17",

pages = "379--389",

journal = "IEEE Transactions on Cognitive and Developmental Systems",

issn = "2379-8920",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "2",

}

TY - JOUR

T1 - Neighborhood-Curiosity-Based Exploration in Multiagent Reinforcement Learning

AU - Yang, Shike

AU - He, Ziming

AU - Li, Jingchen

AU - Shi, Haobin

AU - Ji, Qingbing

AU - Hwang, Kao Shing

AU - Li, Xianshan

PY - 2025

Y1 - 2025

N2 - Efficient exploration in cooperative multiagent reinforcement learning is still tricky in complex tasks. In this article, we propose a novel multiagent collaborative exploration method called neighborhood-curiosity-based exploration (NCE), by which agents can explore not only novel states but also new partnerships. Concretely, we use the attention mechanism in graph convolutional networks to perform a weighted summation of features from neighbors. The calculated attention weights can be regarded as an embodiment of the relationship among agents. Then, we use the prediction errors of the aggregated features as intrinsic rewards to facilitate exploration. When agents encounter novel states or new partnerships, NCE will produce large prediction errors, resulting in large intrinsic rewards. In addition, agents are more influenced by their neighbors and only interact directly with them in multiagent systems. Exploring partnerships between agents and their neighbors can enable agents to capture the most important cooperative relations with other agents. Therefore, NCE can effectively promote collaborative exploration even in environments with a large number of agents. Our experimental results show that NCE achieves significant performance improvements on the challenging StarCraft II micromanagement (SMAC) benchmark.

AB - Efficient exploration in cooperative multiagent reinforcement learning is still tricky in complex tasks. In this article, we propose a novel multiagent collaborative exploration method called neighborhood-curiosity-based exploration (NCE), by which agents can explore not only novel states but also new partnerships. Concretely, we use the attention mechanism in graph convolutional networks to perform a weighted summation of features from neighbors. The calculated attention weights can be regarded as an embodiment of the relationship among agents. Then, we use the prediction errors of the aggregated features as intrinsic rewards to facilitate exploration. When agents encounter novel states or new partnerships, NCE will produce large prediction errors, resulting in large intrinsic rewards. In addition, agents are more influenced by their neighbors and only interact directly with them in multiagent systems. Exploring partnerships between agents and their neighbors can enable agents to capture the most important cooperative relations with other agents. Therefore, NCE can effectively promote collaborative exploration even in environments with a large number of agents. Our experimental results show that NCE achieves significant performance improvements on the challenging StarCraft II micromanagement (SMAC) benchmark.

KW - multiagent reinforcement learning (MARL)

KW - multiagent system

UR - http://www.scopus.com/inward/record.url?scp=105003135570&partnerID=8YFLogxK

U2 - 10.1109/TCDS.2024.3460368

DO - 10.1109/TCDS.2024.3460368

M3 - 文章

AN - SCOPUS:105003135570

SN - 2379-8920

VL - 17

SP - 379

EP - 389

JO - IEEE Transactions on Cognitive and Developmental Systems

JF - IEEE Transactions on Cognitive and Developmental Systems

IS - 2

ER -

Neighborhood-Curiosity-Based Exploration in Multiagent Reinforcement Learning

摘要

访问文件

其它文件与链接

指纹

引用此