跳到主要导航 跳到搜索 跳到主要内容

Random Latent Exploration-Based Multi-Agent Reinforcement Learning for Multi-UAV Path Planning

  • Jinchao Chen
  • , Yuwei Li
  • , Qing Zhou
  • , Zhaohui Liu
  • , Qinwei Zhang
  • , Weihua Liang
  • Northwestern Polytechnical University Xian
  • National Key Laboratory of Avionics Integration and Aviation System-of-Systems Synthesis
  • Shanghai Jiao Tong University
  • North Automatic Control Technology Institute

科研成果: 期刊稿件文章同行评审

摘要

Unmanned aerial vehicles (UAVs) have become increasingly popular in civilian and military applications due to their low cost and high maneuverability. Multi-UAV path planning is a highly challenging problem with an NP-hard complexity, and frequently solved by machine learning algorithms, especially in dynamic and open environments. However, the sparse-reward problem in traditional machine learning algorithms often makes solutions trapped in local optima and results in unsatisfactory f light paths, seriously affecting the coordination effect of UAVs. In this paper, we propose an enhanced random latent exploration based multi-agent reinforcement learning framework to provide a reasonable flight path for each UAV and efficiently achieve the group missions. First, we analyse the constraints and objectives of the multi-UAV path planning problem and abstract it as a multi-constraint combinatorial optimization one. Then, inspired by exploration bonus and randomized value function mechanisms, we propose a random latent exploration-based multi-agent reinforcement learning framework to enable UAVs to acquire more diverse rewards in path planning. Simulation experiments in a multi-agent particle environment are conducted to verify the effectiveness and efficiency of the proposed approach. Compared with baselines, our approach shows significant advantages and achieves a 71.2% increase in average reward, a 46.5% improvement in success rate, a 26% reduction in flight time, and a 33.6% reduction in path length.

源语言英语
期刊IEEE Transactions on Vehicular Technology
DOI
出版状态已接受/待刊 - 2026

指纹

探究 'Random Latent Exploration-Based Multi-Agent Reinforcement Learning for Multi-UAV Path Planning' 的科研主题。它们共同构成独一无二的指纹。

引用此