多智能体编队控制中的迁移强化学习算法研究

Penglin Hu; Quan Pan; Yaning Guo; Chunhui Zhao

doi:10.1051/jnwpu/20234120389

多智能体编队控制中的迁移强化学习算法研究

Penglin Hu, Quan Pan, Yaning Guo, Chunhui Zhao

自动化学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

3 引用（Scopus）

摘要

Considering the obstacle avoidance and collision avoidance for multi-agent cooperative formation in multi-obstacle environment, a formation control algorithm based on transfer learning and reinforcement learning is proposed. Firstly, in the source task learning stage, the large storage space required by Q-table solution is avoided by using the value function approximation method, which effectively reduces the storage space requirement and im- proves the solving speed of the algorithm. Secondly, in the learning phase of the target task, Gaussian clustering al- gorithm was used to classify the source tasks. According to the distance between the clustering center and the target task, the optimal source task class was selected for target task learning, which effectively avoided the negative transfer phenomenon, and improved the generalization ability and convergence speed of reinforcement learning algo- rithm. Finally, the simulation results show that this method can effectively form and maintain formation configuration of multi-agent system in complex environment with obstacles, and realize obstacle avoidance and collision avoidance at the same time.

投稿的翻译标题	Study on learning algorithm of transfer reinforcement for multi-agent formation control
源语言	繁体中文
页（从-至）	389-399
页数	11
期刊	Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University
卷	41
期	2
DOI	https://doi.org/10.1051/jnwpu/20234120389
出版状态	已出版 - 4月 2023

关键词

formation control
Gaussian clustering
multi-agent system
transfer reinforcement learning
value function approximation

访问文件

10.1051/jnwpu/20234120389

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{442a0962853b4e2b945a92e13c414daa,

title = "多智能体编队控制中的迁移强化学习算法研究",

abstract = "Considering the obstacle avoidance and collision avoidance for multi-agent cooperative formation in multi-obstacle environment, a formation control algorithm based on transfer learning and reinforcement learning is proposed. Firstly, in the source task learning stage, the large storage space required by Q-table solution is avoided by using the value function approximation method, which effectively reduces the storage space requirement and im- proves the solving speed of the algorithm. Secondly, in the learning phase of the target task, Gaussian clustering al- gorithm was used to classify the source tasks. According to the distance between the clustering center and the target task, the optimal source task class was selected for target task learning, which effectively avoided the negative transfer phenomenon, and improved the generalization ability and convergence speed of reinforcement learning algo- rithm. Finally, the simulation results show that this method can effectively form and maintain formation configuration of multi-agent system in complex environment with obstacles, and realize obstacle avoidance and collision avoidance at the same time.",

keywords = "formation control, Gaussian clustering, multi-agent system, transfer reinforcement learning, value function approximation",

author = "Penglin Hu and Quan Pan and Yaning Guo and Chunhui Zhao",

note = "Publisher Copyright: {\textcopyright}2023 Journal of Northwestern Polytechnical University.",

year = "2023",

month = apr,

doi = "10.1051/jnwpu/20234120389",

language = "繁体中文",

volume = "41",

pages = "389--399",

journal = "Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University",

issn = "1000-2758",

publisher = "Northwestern Polytechnical University",

number = "2",

}

TY - JOUR

T1 - 多智能体编队控制中的迁移强化学习算法研究

AU - Hu, Penglin

AU - Pan, Quan

AU - Guo, Yaning

AU - Zhao, Chunhui

PY - 2023/4

Y1 - 2023/4

N2 - Considering the obstacle avoidance and collision avoidance for multi-agent cooperative formation in multi-obstacle environment, a formation control algorithm based on transfer learning and reinforcement learning is proposed. Firstly, in the source task learning stage, the large storage space required by Q-table solution is avoided by using the value function approximation method, which effectively reduces the storage space requirement and im- proves the solving speed of the algorithm. Secondly, in the learning phase of the target task, Gaussian clustering al- gorithm was used to classify the source tasks. According to the distance between the clustering center and the target task, the optimal source task class was selected for target task learning, which effectively avoided the negative transfer phenomenon, and improved the generalization ability and convergence speed of reinforcement learning algo- rithm. Finally, the simulation results show that this method can effectively form and maintain formation configuration of multi-agent system in complex environment with obstacles, and realize obstacle avoidance and collision avoidance at the same time.

AB - Considering the obstacle avoidance and collision avoidance for multi-agent cooperative formation in multi-obstacle environment, a formation control algorithm based on transfer learning and reinforcement learning is proposed. Firstly, in the source task learning stage, the large storage space required by Q-table solution is avoided by using the value function approximation method, which effectively reduces the storage space requirement and im- proves the solving speed of the algorithm. Secondly, in the learning phase of the target task, Gaussian clustering al- gorithm was used to classify the source tasks. According to the distance between the clustering center and the target task, the optimal source task class was selected for target task learning, which effectively avoided the negative transfer phenomenon, and improved the generalization ability and convergence speed of reinforcement learning algo- rithm. Finally, the simulation results show that this method can effectively form and maintain formation configuration of multi-agent system in complex environment with obstacles, and realize obstacle avoidance and collision avoidance at the same time.

KW - formation control

KW - Gaussian clustering

KW - multi-agent system

KW - transfer reinforcement learning

KW - value function approximation

UR - http://www.scopus.com/inward/record.url?scp=85162921022&partnerID=8YFLogxK

U2 - 10.1051/jnwpu/20234120389

DO - 10.1051/jnwpu/20234120389

M3 - 文章

AN - SCOPUS:85162921022

SN - 1000-2758

VL - 41

SP - 389

EP - 399

JO - Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University

JF - Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University

IS - 2

ER -

多智能体编队控制中的迁移强化学习算法研究

摘要

关键词

访问文件

其它文件与链接

指纹

引用此