Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning

Zhuoying Chen; Huiping Li; Rizhong Wang; Di Cui

doi:10.23919/CCC63176.2024.10661678

Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning

Zhuoying Chen, Huiping Li, Rizhong Wang, Di Cui

航海学院

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

1 引用（Scopus）

摘要

Due to the limitations of load, perception ability and communication range, single agent is difficult to meet the increasingly complex task requirements. As a result, the multi-agent reinforcement learning algorithm may attract more attention. However, the algorithm convergence becomes more difficult with the increase of the agent numbers. In this article, an efficient training framework called Progressive Prioritized Experience Replay (PPER) is proposed to resolve this problem. PPER decomposes the task scene into several similar sub-scenes with a complex degree from easy to difficult. The progressive training (PT) approach is adopted to let the agent accumulate learning experience in sub-scenes before access to the task scene, which greatly reduces the training difficulty. To verify the effectiveness of our training framework, we extended OpenAI gym to create a multi-USV confrontation environment, and the superior performance of PPER has been demonstrated in comparative tests.

源语言	英语
主期刊名	Proceedings of the 43rd Chinese Control Conference, CCC 2024
编辑	Jing Na, Jian Sun
出版商	IEEE Computer Society
页	8292-8296
页数	5
ISBN（电子版）	9789887581581
DOI	https://doi.org/10.23919/CCC63176.2024.10661678
出版状态	已出版 - 2024
活动	43rd Chinese Control Conference, CCC 2024 - Kunming, 中国期限: 28 7月 2024 → 31 7月 2024

出版系列

姓名	Chinese Control Conference, CCC
ISSN（印刷版）	1934-1768
ISSN（电子版）	2161-2927

会议

会议	43rd Chinese Control Conference, CCC 2024
国家/地区	中国
市	Kunming
时期	28/07/24 → 31/07/24

访问文件

10.23919/CCC63176.2024.10661678

其它文件与链接

链接到 Scopus 的出版物

引用此

@inproceedings{6934c94e5a664e7b8bdea3b52ee7a372,

title = "Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning",

abstract = "Due to the limitations of load, perception ability and communication range, single agent is difficult to meet the increasingly complex task requirements. As a result, the multi-agent reinforcement learning algorithm may attract more attention. However, the algorithm convergence becomes more difficult with the increase of the agent numbers. In this article, an efficient training framework called Progressive Prioritized Experience Replay (PPER) is proposed to resolve this problem. PPER decomposes the task scene into several similar sub-scenes with a complex degree from easy to difficult. The progressive training (PT) approach is adopted to let the agent accumulate learning experience in sub-scenes before access to the task scene, which greatly reduces the training difficulty. To verify the effectiveness of our training framework, we extended OpenAI gym to create a multi-USV confrontation environment, and the superior performance of PPER has been demonstrated in comparative tests.",

keywords = "Multi-USV, PPER, Progressive Training",

author = "Zhuoying Chen and Huiping Li and Rizhong Wang and Di Cui",

note = "Publisher Copyright: {\textcopyright} 2024 Technical Committee on Control Theory, Chinese Association of Automation.; 43rd Chinese Control Conference, CCC 2024 ; Conference date: 28-07-2024 Through 31-07-2024",

year = "2024",

doi = "10.23919/CCC63176.2024.10661678",

language = "英语",

series = "Chinese Control Conference, CCC",

publisher = "IEEE Computer Society",

pages = "8292--8296",

editor = "Jing Na and Jian Sun",

booktitle = "Proceedings of the 43rd Chinese Control Conference, CCC 2024",

}

Chen, Z, Li, H, Wang, R & Cui, D 2024, Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning. 在 J Na & J Sun (编辑), Proceedings of the 43rd Chinese Control Conference, CCC 2024. Chinese Control Conference, CCC, IEEE Computer Society, 页码 8292-8296, 43rd Chinese Control Conference, CCC 2024, Kunming, 中国, 28/07/24. https://doi.org/10.23919/CCC63176.2024.10661678

Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning. / Chen, Zhuoying; Li, Huiping; Wang, Rizhong 等.
Proceedings of the 43rd Chinese Control Conference, CCC 2024. 编辑 / Jing Na; Jian Sun. IEEE Computer Society, 2024. 页码 8292-8296 (Chinese Control Conference, CCC).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning

AU - Chen, Zhuoying

AU - Li, Huiping

AU - Wang, Rizhong

AU - Cui, Di

PY - 2024

Y1 - 2024

N2 - Due to the limitations of load, perception ability and communication range, single agent is difficult to meet the increasingly complex task requirements. As a result, the multi-agent reinforcement learning algorithm may attract more attention. However, the algorithm convergence becomes more difficult with the increase of the agent numbers. In this article, an efficient training framework called Progressive Prioritized Experience Replay (PPER) is proposed to resolve this problem. PPER decomposes the task scene into several similar sub-scenes with a complex degree from easy to difficult. The progressive training (PT) approach is adopted to let the agent accumulate learning experience in sub-scenes before access to the task scene, which greatly reduces the training difficulty. To verify the effectiveness of our training framework, we extended OpenAI gym to create a multi-USV confrontation environment, and the superior performance of PPER has been demonstrated in comparative tests.

AB - Due to the limitations of load, perception ability and communication range, single agent is difficult to meet the increasingly complex task requirements. As a result, the multi-agent reinforcement learning algorithm may attract more attention. However, the algorithm convergence becomes more difficult with the increase of the agent numbers. In this article, an efficient training framework called Progressive Prioritized Experience Replay (PPER) is proposed to resolve this problem. PPER decomposes the task scene into several similar sub-scenes with a complex degree from easy to difficult. The progressive training (PT) approach is adopted to let the agent accumulate learning experience in sub-scenes before access to the task scene, which greatly reduces the training difficulty. To verify the effectiveness of our training framework, we extended OpenAI gym to create a multi-USV confrontation environment, and the superior performance of PPER has been demonstrated in comparative tests.

KW - Multi-USV

KW - PPER

KW - Progressive Training

UR - http://www.scopus.com/inward/record.url?scp=85205494756&partnerID=8YFLogxK

U2 - 10.23919/CCC63176.2024.10661678

DO - 10.23919/CCC63176.2024.10661678

M3 - 会议稿件

AN - SCOPUS:85205494756

T3 - Chinese Control Conference, CCC

SP - 8292

EP - 8296

BT - Proceedings of the 43rd Chinese Control Conference, CCC 2024

A2 - Na, Jing

A2 - Sun, Jian

PB - IEEE Computer Society

T2 - 43rd Chinese Control Conference, CCC 2024

Y2 - 28 July 2024 through 31 July 2024

ER -

Progressive Prioritized Experience Replay for Multi-Agent Reinforcement Learning

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此