Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks

Kai Li; Quanhu Wang; Mengyao Gong; Jiahui Li; Haobin Shi

doi:10.1109/ISCEIC59030.2023.10271221

Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks

Kai Li, Quanhu Wang, Mengyao Gong, Jiahui Li, Haobin Shi

School of Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

Multi-robot systems can collaborate to accomplish more complex tasks than a single robot. Cooperative navigation is the basis for multi-robot to complete rescue, reconnaissance, and other tasks in high-risk areas instead of human beings. Multi-agent reinforcement learning (MARL) is the most effective method to control multi-robot cooperation, but the sparsity of rewards limits its application in real scenarios. In this paper, a curiosity-inspired MARL approach which is called CIMADDPG is proposed to promote robot exploration. The global curiosity allocation mechanism is designed to determine each agent's contribution to the global reward. In addition, to ensure that the collaboration of agents is not lost during exploration, the dual critic network is designed to guide the update of the policy network jointly. Finally, the performance of the proposed method is verified in a multi-agent particle environment (MPE) and multi-robot (Turtlebot3) cooperative navigation simulation environment. The experimental results show that CIMADDPG improves the performance of SOTA by 23.53% 48.84% and achieves a high success rate in multi-robot cooperative navigation.

Original language	English
Title of host publication	2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	257-261
Number of pages	5
ISBN (Electronic)	9798350306323
DOIs	https://doi.org/10.1109/ISCEIC59030.2023.10271221
State	Published - 2023
Event	4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023 - Hybrid, Nanjing, China Duration: 18 Aug 2023 → 20 Aug 2023

Publication series

Name	2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023

Conference

Conference	4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023
Country/Territory	China
City	Hybrid, Nanjing
Period	18/08/23 → 20/08/23

Keywords

collaborative navigation
deep reinforcement learning
muliti-robot
multi-agent reinforcement learning

Access to Document

10.1109/ISCEIC59030.2023.10271221

Cite this

Li, K., Wang, Q., Gong, M., Li, J., & Shi, H. (2023). Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks. In 2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023 (pp. 257-261). (2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ISCEIC59030.2023.10271221

Li, Kai ; Wang, Quanhu ; Gong, Mengyao et al. / Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks. 2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023. Institute of Electrical and Electronics Engineers Inc., 2023. pp. 257-261 (2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023).

@inproceedings{2e954900dd79498bbf2d3c2841ae0f02,

title = "Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks",

abstract = "Multi-robot systems can collaborate to accomplish more complex tasks than a single robot. Cooperative navigation is the basis for multi-robot to complete rescue, reconnaissance, and other tasks in high-risk areas instead of human beings. Multi-agent reinforcement learning (MARL) is the most effective method to control multi-robot cooperation, but the sparsity of rewards limits its application in real scenarios. In this paper, a curiosity-inspired MARL approach which is called CIMADDPG is proposed to promote robot exploration. The global curiosity allocation mechanism is designed to determine each agent's contribution to the global reward. In addition, to ensure that the collaboration of agents is not lost during exploration, the dual critic network is designed to guide the update of the policy network jointly. Finally, the performance of the proposed method is verified in a multi-agent particle environment (MPE) and multi-robot (Turtlebot3) cooperative navigation simulation environment. The experimental results show that CIMADDPG improves the performance of SOTA by 23.53% 48.84% and achieves a high success rate in multi-robot cooperative navigation.",

keywords = "collaborative navigation, deep reinforcement learning, muliti-robot, multi-agent reinforcement learning",

author = "Kai Li and Quanhu Wang and Mengyao Gong and Jiahui Li and Haobin Shi",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023 ; Conference date: 18-08-2023 Through 20-08-2023",

year = "2023",

doi = "10.1109/ISCEIC59030.2023.10271221",

language = "英语",

series = "2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "257--261",

booktitle = "2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023",

}

Li, K, Wang, Q, Gong, M, Li, J & Shi, H 2023, Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks. in 2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023. 2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023, Institute of Electrical and Electronics Engineers Inc., pp. 257-261, 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023, Hybrid, Nanjing, China, 18/08/23. https://doi.org/10.1109/ISCEIC59030.2023.10271221

Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks. / Li, Kai; Wang, Quanhu; Gong, Mengyao et al.
2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023. Institute of Electrical and Electronics Engineers Inc., 2023. p. 257-261 (2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks

AU - Li, Kai

AU - Wang, Quanhu

AU - Gong, Mengyao

AU - Li, Jiahui

AU - Shi, Haobin

PY - 2023

Y1 - 2023

N2 - Multi-robot systems can collaborate to accomplish more complex tasks than a single robot. Cooperative navigation is the basis for multi-robot to complete rescue, reconnaissance, and other tasks in high-risk areas instead of human beings. Multi-agent reinforcement learning (MARL) is the most effective method to control multi-robot cooperation, but the sparsity of rewards limits its application in real scenarios. In this paper, a curiosity-inspired MARL approach which is called CIMADDPG is proposed to promote robot exploration. The global curiosity allocation mechanism is designed to determine each agent's contribution to the global reward. In addition, to ensure that the collaboration of agents is not lost during exploration, the dual critic network is designed to guide the update of the policy network jointly. Finally, the performance of the proposed method is verified in a multi-agent particle environment (MPE) and multi-robot (Turtlebot3) cooperative navigation simulation environment. The experimental results show that CIMADDPG improves the performance of SOTA by 23.53% 48.84% and achieves a high success rate in multi-robot cooperative navigation.

AB - Multi-robot systems can collaborate to accomplish more complex tasks than a single robot. Cooperative navigation is the basis for multi-robot to complete rescue, reconnaissance, and other tasks in high-risk areas instead of human beings. Multi-agent reinforcement learning (MARL) is the most effective method to control multi-robot cooperation, but the sparsity of rewards limits its application in real scenarios. In this paper, a curiosity-inspired MARL approach which is called CIMADDPG is proposed to promote robot exploration. The global curiosity allocation mechanism is designed to determine each agent's contribution to the global reward. In addition, to ensure that the collaboration of agents is not lost during exploration, the dual critic network is designed to guide the update of the policy network jointly. Finally, the performance of the proposed method is verified in a multi-agent particle environment (MPE) and multi-robot (Turtlebot3) cooperative navigation simulation environment. The experimental results show that CIMADDPG improves the performance of SOTA by 23.53% 48.84% and achieves a high success rate in multi-robot cooperative navigation.

KW - collaborative navigation

KW - deep reinforcement learning

KW - muliti-robot

KW - multi-agent reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85175613071&partnerID=8YFLogxK

U2 - 10.1109/ISCEIC59030.2023.10271221

DO - 10.1109/ISCEIC59030.2023.10271221

M3 - 会议稿件

AN - SCOPUS:85175613071

T3 - 2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023

SP - 257

EP - 261

BT - 2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023

Y2 - 18 August 2023 through 20 August 2023

ER -

Li K, Wang Q, Gong M, Li J, Shi H. Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks. In 2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023. Institute of Electrical and Electronics Engineers Inc. 2023. p. 257-261. (2023 4th International Symposium on Computer Engineering and Intelligent Communications, ISCEIC 2023). doi: 10.1109/ISCEIC59030.2023.10271221

Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this