Skip to main navigation Skip to search Skip to main content

Dual-Centralized Q-Network-Based Reinforcement Learning for Cooperative Path Planning of Multiple UAVs

  • Jinchao Chen
  • , Chongde Ren
  • , Yujiao Hu
  • , Ying Zhang
  • , Yantao Lu
  • , Qing Li
  • , Tao You
  • , Joel J.P.C. Rodrigues
  • Northwestern Polytechnical University Xian
  • Purple Mountain Laboratories for Network and Communication Security
  • Lusófona University

Research output: Contribution to journalArticlepeer-review

14 Scopus citations

Abstract

Due to the low cost and high maneuverability, unmanned aerial vehicles (UAVs) have been commonly used and played an important role in both the civilian and military fields. Although UAVs can significantly achieve enhanced flexibility and extensibility for large-scale intelligent systems, they result in a serious path planning problem. Especially in complex environments with a large number of irregular obstacles, UAVs have to efficiently find near-optimization flight paths and automatically move to target positions to finish the group task while avoiding collisions and satisfying various constraints. In this work, we focus on the cooperative path planning problem of homogeneous UAVs and present a multi-agent reinforcement learning-based approach to solve the problem. First, with the UAV and obstacle models, we analyse the collision avoidance, motion continuity, and energy consumption constraints in UAV flying, and formulate the cooperative path planning problem as a multi-constraint combinatorial optimization one with a high computational complexity. Then, inspired by the twin delayed deep deterministic policy gradient algorithm where clipped dual Q-networks are used to decrease the overestimation error of critic networks, we propose a multi-agent reinforcement learning-based approach with a dual-centralized Q-network mechanism to automatically produce feasible and collision-free flight path for each UAV. Finally, simulation experiments are conducted in a multi-agent particle environment to evaluate the effectiveness and efficiency of the proposed approach.

Original languageEnglish
Pages (from-to)13232-13246
Number of pages15
JournalIEEE Transactions on Intelligent Transportation Systems
Volume26
Issue number9
DOIs
StatePublished - 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Path planning
  • dual-centralized Q-network
  • multi-agent reinforcement learning
  • unmanned aerial vehicle

Fingerprint

Dive into the research topics of 'Dual-Centralized Q-Network-Based Reinforcement Learning for Cooperative Path Planning of Multiple UAVs'. Together they form a unique fingerprint.

Cite this