Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning

Zhuoying Chen; Huiping Li; Rizhong Wang; Di Cui

doi:10.23919/CCC63176.2024.10661678

Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning

Zhuoying Chen, Huiping Li, Rizhong Wang, Di Cui

School of Marine Science and Technology

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

Due to the limitations of load, perception ability and communication range, single agent is difficult to meet the increasingly complex task requirements. As a result, the multi-agent reinforcement learning algorithm may attract more attention. However, the algorithm convergence becomes more difficult with the increase of the agent numbers. In this article, an efficient training framework called Progressive Prioritized Experience Replay (PPER) is proposed to resolve this problem. PPER decomposes the task scene into several similar sub-scenes with a complex degree from easy to difficult. The progressive training (PT) approach is adopted to let the agent accumulate learning experience in sub-scenes before access to the task scene, which greatly reduces the training difficulty. To verify the effectiveness of our training framework, we extended OpenAI gym to create a multi-USV confrontation environment, and the superior performance of PPER has been demonstrated in comparative tests.

Original language	English
Title of host publication	Proceedings of the 43rd Chinese Control Conference, CCC 2024
Editors	Jing Na, Jian Sun
Publisher	IEEE Computer Society
Pages	8292-8296
Number of pages	5
ISBN (Electronic)	9789887581581
DOIs	https://doi.org/10.23919/CCC63176.2024.10661678
State	Published - 2024
Event	43rd Chinese Control Conference, CCC 2024 - Kunming, China Duration: 28 Jul 2024 → 31 Jul 2024

Publication series

Name	Chinese Control Conference, CCC
ISSN (Print)	1934-1768
ISSN (Electronic)	2161-2927

Conference

Conference	43rd Chinese Control Conference, CCC 2024
Country/Territory	China
City	Kunming
Period	28/07/24 → 31/07/24

Keywords

Multi-USV
PPER
Progressive Training

Access to Document

10.23919/CCC63176.2024.10661678

Cite this

@inproceedings{6934c94e5a664e7b8bdea3b52ee7a372,

title = "Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning",

abstract = "Due to the limitations of load, perception ability and communication range, single agent is difficult to meet the increasingly complex task requirements. As a result, the multi-agent reinforcement learning algorithm may attract more attention. However, the algorithm convergence becomes more difficult with the increase of the agent numbers. In this article, an efficient training framework called Progressive Prioritized Experience Replay (PPER) is proposed to resolve this problem. PPER decomposes the task scene into several similar sub-scenes with a complex degree from easy to difficult. The progressive training (PT) approach is adopted to let the agent accumulate learning experience in sub-scenes before access to the task scene, which greatly reduces the training difficulty. To verify the effectiveness of our training framework, we extended OpenAI gym to create a multi-USV confrontation environment, and the superior performance of PPER has been demonstrated in comparative tests.",

keywords = "Multi-USV, PPER, Progressive Training",

author = "Zhuoying Chen and Huiping Li and Rizhong Wang and Di Cui",

note = "Publisher Copyright: {\textcopyright} 2024 Technical Committee on Control Theory, Chinese Association of Automation.; 43rd Chinese Control Conference, CCC 2024 ; Conference date: 28-07-2024 Through 31-07-2024",

year = "2024",

doi = "10.23919/CCC63176.2024.10661678",

language = "英语",

series = "Chinese Control Conference, CCC",

publisher = "IEEE Computer Society",

pages = "8292--8296",

editor = "Jing Na and Jian Sun",

booktitle = "Proceedings of the 43rd Chinese Control Conference, CCC 2024",

}

Chen, Z, Li, H, Wang, R & Cui, D 2024, Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning. in J Na & J Sun (eds), Proceedings of the 43rd Chinese Control Conference, CCC 2024. Chinese Control Conference, CCC, IEEE Computer Society, pp. 8292-8296, 43rd Chinese Control Conference, CCC 2024, Kunming, China, 28/07/24. https://doi.org/10.23919/CCC63176.2024.10661678

Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning. / Chen, Zhuoying; Li, Huiping; Wang, Rizhong et al.
Proceedings of the 43rd Chinese Control Conference, CCC 2024. ed. / Jing Na; Jian Sun. IEEE Computer Society, 2024. p. 8292-8296 (Chinese Control Conference, CCC).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning

AU - Chen, Zhuoying

AU - Li, Huiping

AU - Wang, Rizhong

AU - Cui, Di

PY - 2024

Y1 - 2024

N2 - Due to the limitations of load, perception ability and communication range, single agent is difficult to meet the increasingly complex task requirements. As a result, the multi-agent reinforcement learning algorithm may attract more attention. However, the algorithm convergence becomes more difficult with the increase of the agent numbers. In this article, an efficient training framework called Progressive Prioritized Experience Replay (PPER) is proposed to resolve this problem. PPER decomposes the task scene into several similar sub-scenes with a complex degree from easy to difficult. The progressive training (PT) approach is adopted to let the agent accumulate learning experience in sub-scenes before access to the task scene, which greatly reduces the training difficulty. To verify the effectiveness of our training framework, we extended OpenAI gym to create a multi-USV confrontation environment, and the superior performance of PPER has been demonstrated in comparative tests.

AB - Due to the limitations of load, perception ability and communication range, single agent is difficult to meet the increasingly complex task requirements. As a result, the multi-agent reinforcement learning algorithm may attract more attention. However, the algorithm convergence becomes more difficult with the increase of the agent numbers. In this article, an efficient training framework called Progressive Prioritized Experience Replay (PPER) is proposed to resolve this problem. PPER decomposes the task scene into several similar sub-scenes with a complex degree from easy to difficult. The progressive training (PT) approach is adopted to let the agent accumulate learning experience in sub-scenes before access to the task scene, which greatly reduces the training difficulty. To verify the effectiveness of our training framework, we extended OpenAI gym to create a multi-USV confrontation environment, and the superior performance of PPER has been demonstrated in comparative tests.

KW - Multi-USV

KW - PPER

KW - Progressive Training

UR - http://www.scopus.com/inward/record.url?scp=85205494756&partnerID=8YFLogxK

U2 - 10.23919/CCC63176.2024.10661678

DO - 10.23919/CCC63176.2024.10661678

M3 - 会议稿件

AN - SCOPUS:85205494756

T3 - Chinese Control Conference, CCC

SP - 8292

EP - 8296

BT - Proceedings of the 43rd Chinese Control Conference, CCC 2024

A2 - Na, Jing

A2 - Sun, Jian

PB - IEEE Computer Society

T2 - 43rd Chinese Control Conference, CCC 2024

Y2 - 28 July 2024 through 31 July 2024

ER -

Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this