多智能体编队控制中的迁移强化学习算法研究

Penglin Hu, Quan Pan, Yaning Guo, Chunhui Zhao

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

Considering the obstacle avoidance and collision avoidance for multi-agent cooperative formation in multi-obstacle environment, a formation control algorithm based on transfer learning and reinforcement learning is proposed. Firstly, in the source task learning stage, the large storage space required by Q-table solution is avoided by using the value function approximation method, which effectively reduces the storage space requirement and im- proves the solving speed of the algorithm. Secondly, in the learning phase of the target task, Gaussian clustering al- gorithm was used to classify the source tasks. According to the distance between the clustering center and the target task, the optimal source task class was selected for target task learning, which effectively avoided the negative transfer phenomenon, and improved the generalization ability and convergence speed of reinforcement learning algo- rithm. Finally, the simulation results show that this method can effectively form and maintain formation configuration of multi-agent system in complex environment with obstacles, and realize obstacle avoidance and collision avoidance at the same time.

投稿的翻译标题Study on learning algorithm of transfer reinforcement for multi-agent formation control
源语言繁体中文
页(从-至)389-399
页数11
期刊Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University
41
2
DOI
出版状态已出版 - 4月 2023

关键词

  • formation control
  • Gaussian clustering
  • multi-agent system
  • transfer reinforcement learning
  • value function approximation

指纹

探究 '多智能体编队控制中的迁移强化学习算法研究' 的科研主题。它们共同构成独一无二的指纹。

引用此