Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments

Dingwei Wu; Kaifang Wan; Xiaoguang Gao; Zijian Hu

doi:10.1109/ICCRE51898.2021.9435656

Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments

Dingwei Wu, Kaifang Wan, Xiaoguang Gao, Zijian Hu

School of Electronics and Information

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

8 Scopus citations

Abstract

When agents in a multiagent system implement motion planning in complex and dynamic environments, model-based planning algorithms have poor adaptability, while intelligent algorithms, such as MADDPG, encounter difficulty in converging when training multiple agents, and the resulting control model has poor stability and robustness. To address the above challenges, this paper proposes a mixed experience multiagent deep deterministic policy gradient algorithm referred to as ME-MADDPG. The algorithm increases the high-quality experience obtained by artificial potential field method and uses dynamic probability to sample from different replay buffers. Simulation experiments have proven that compared to MADDPG, ME-MADDPG greatly improves convergence speed, convergence effect and stability and that ME-MADDPG can efficiently provide shorter and more convenient paths for multiagent systems.

Original language	English
Title of host publication	2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	123-128
Number of pages	6
ISBN (Electronic)	9780738126128
DOIs	https://doi.org/10.1109/ICCRE51898.2021.9435656
State	Published - 16 Apr 2021
Event	6th International Conference on Control and Robotics Engineering, ICCRE 2021 - Virtual, Beijing, China Duration: 16 Apr 2021 → 18 Apr 2021

Publication series

Name	2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021

Conference

Conference	6th International Conference on Control and Robotics Engineering, ICCRE 2021
Country/Territory	China
City	Virtual, Beijing
Period	16/04/21 → 18/04/21

Keywords

deep reinforcement learning
MADDPG
motion planning
multiagent

Access to Document

10.1109/ICCRE51898.2021.9435656

Cite this

Wu, D., Wan, K., Gao, X., & Hu, Z. (2021). Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments. In 2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021 (pp. 123-128). Article 9435656 (2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCRE51898.2021.9435656

Wu, Dingwei ; Wan, Kaifang ; Gao, Xiaoguang et al. / Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments. 2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021. Institute of Electrical and Electronics Engineers Inc., 2021. pp. 123-128 (2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021).

@inproceedings{c971c4b4b77445aca6b0906faf0554eb,

title = "Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments",

abstract = "When agents in a multiagent system implement motion planning in complex and dynamic environments, model-based planning algorithms have poor adaptability, while intelligent algorithms, such as MADDPG, encounter difficulty in converging when training multiple agents, and the resulting control model has poor stability and robustness. To address the above challenges, this paper proposes a mixed experience multiagent deep deterministic policy gradient algorithm referred to as ME-MADDPG. The algorithm increases the high-quality experience obtained by artificial potential field method and uses dynamic probability to sample from different replay buffers. Simulation experiments have proven that compared to MADDPG, ME-MADDPG greatly improves convergence speed, convergence effect and stability and that ME-MADDPG can efficiently provide shorter and more convenient paths for multiagent systems.",

keywords = "deep reinforcement learning, MADDPG, motion planning, multiagent",

author = "Dingwei Wu and Kaifang Wan and Xiaoguang Gao and Zijian Hu",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 6th International Conference on Control and Robotics Engineering, ICCRE 2021 ; Conference date: 16-04-2021 Through 18-04-2021",

year = "2021",

month = apr,

day = "16",

doi = "10.1109/ICCRE51898.2021.9435656",

language = "英语",

series = "2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "123--128",

booktitle = "2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021",

}

Wu, D, Wan, K, Gao, X & Hu, Z 2021, Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments. in 2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021., 9435656, 2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021, Institute of Electrical and Electronics Engineers Inc., pp. 123-128, 6th International Conference on Control and Robotics Engineering, ICCRE 2021, Virtual, Beijing, China, 16/04/21. https://doi.org/10.1109/ICCRE51898.2021.9435656

Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments. / Wu, Dingwei; Wan, Kaifang; Gao, Xiaoguang et al.
2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 123-128 9435656 (2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments

AU - Wu, Dingwei

AU - Wan, Kaifang

AU - Gao, Xiaoguang

AU - Hu, Zijian

PY - 2021/4/16

Y1 - 2021/4/16

N2 - When agents in a multiagent system implement motion planning in complex and dynamic environments, model-based planning algorithms have poor adaptability, while intelligent algorithms, such as MADDPG, encounter difficulty in converging when training multiple agents, and the resulting control model has poor stability and robustness. To address the above challenges, this paper proposes a mixed experience multiagent deep deterministic policy gradient algorithm referred to as ME-MADDPG. The algorithm increases the high-quality experience obtained by artificial potential field method and uses dynamic probability to sample from different replay buffers. Simulation experiments have proven that compared to MADDPG, ME-MADDPG greatly improves convergence speed, convergence effect and stability and that ME-MADDPG can efficiently provide shorter and more convenient paths for multiagent systems.

AB - When agents in a multiagent system implement motion planning in complex and dynamic environments, model-based planning algorithms have poor adaptability, while intelligent algorithms, such as MADDPG, encounter difficulty in converging when training multiple agents, and the resulting control model has poor stability and robustness. To address the above challenges, this paper proposes a mixed experience multiagent deep deterministic policy gradient algorithm referred to as ME-MADDPG. The algorithm increases the high-quality experience obtained by artificial potential field method and uses dynamic probability to sample from different replay buffers. Simulation experiments have proven that compared to MADDPG, ME-MADDPG greatly improves convergence speed, convergence effect and stability and that ME-MADDPG can efficiently provide shorter and more convenient paths for multiagent systems.

KW - deep reinforcement learning

KW - MADDPG

KW - motion planning

KW - multiagent

UR - http://www.scopus.com/inward/record.url?scp=85107775603&partnerID=8YFLogxK

U2 - 10.1109/ICCRE51898.2021.9435656

DO - 10.1109/ICCRE51898.2021.9435656

M3 - 会议稿件

AN - SCOPUS:85107775603

T3 - 2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021

SP - 123

EP - 128

BT - 2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 6th International Conference on Control and Robotics Engineering, ICCRE 2021

Y2 - 16 April 2021 through 18 April 2021

ER -

Wu D, Wan K, Gao X, Hu Z. Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments. In 2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021. Institute of Electrical and Electronics Engineers Inc. 2021. p. 123-128. 9435656. (2021 6th International Conference on Control and Robotics Engineering, ICCRE 2021). doi: 10.1109/ICCRE51898.2021.9435656

Multiagent Motion Planning Based on Deep Reinforcement Learning in Complex Environments

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this