基于 MASAC 强化学习算法的多无人机协同路径规划

Chengliang Fang; Feisheng Yang; Quan Pan

doi:10.1360/SSI-2024-0050

基于 MASAC 强化学习算法的多无人机协同路径规划

Translated title of the contribution: Multi-UAV collaborative path planning based on multi-agent soft actor critic

Chengliang Fang, Feisheng Yang, Quan Pan

School of Automation

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

This paper proposes a novel multi-agent deep reinforcement learning algorithm for the collaborative path planning problem of heterogeneous unmanned aerial vehicles (UAVs) in a dynamic uncertain environment. Firstly, a reinforcement learning environment for UAVs is developed to reach a target location in an airspace scenario, where the environment introduces the UAV dynamics equations and considers the UAV heterogeneity as well as the requirement for safe obstacle avoidance. Secondly, evaluation metrics including task completion rate, formation maintenance rate, flight time, flight trajectory, and energy consumption are designed to evaluate the algorithm performance. Then, the multi-UAV collaborative path planning problem is modeled as a partially observable Markov decision process and a multi-agent soft actor critic algorithm is proposed to seek the approximate optimal strategy for the problem. Finally, the effectiveness and superiority of the proposed algorithm are demonstrated through simulations.

Translated title of the contribution	Multi-UAV collaborative path planning based on multi-agent soft actor critic
Original language	Chinese (Traditional)
Pages (from-to)	1871-1883
Number of pages	13
Journal	Scientia Sinica Informationis
Volume	54
Issue number	8
DOIs	https://doi.org/10.1360/SSI-2024-0050
State	Published - 2024

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1360/SSI-2024-0050

Cite this

@article{39caa522d637432294b5997b48334d63,

title = "基于 MASAC 强化学习算法的多无人机协同路径规划",

abstract = "This paper proposes a novel multi-agent deep reinforcement learning algorithm for the collaborative path planning problem of heterogeneous unmanned aerial vehicles (UAVs) in a dynamic uncertain environment. Firstly, a reinforcement learning environment for UAVs is developed to reach a target location in an airspace scenario, where the environment introduces the UAV dynamics equations and considers the UAV heterogeneity as well as the requirement for safe obstacle avoidance. Secondly, evaluation metrics including task completion rate, formation maintenance rate, flight time, flight trajectory, and energy consumption are designed to evaluate the algorithm performance. Then, the multi-UAV collaborative path planning problem is modeled as a partially observable Markov decision process and a multi-agent soft actor critic algorithm is proposed to seek the approximate optimal strategy for the problem. Finally, the effectiveness and superiority of the proposed algorithm are demonstrated through simulations.",

keywords = "multi-agent deep reinforcement learning, multi-agent soft actor critic algorithm, multi-UAV, partially observable Markov decision process, path planning",

author = "Chengliang Fang and Feisheng Yang and Quan Pan",

year = "2024",

doi = "10.1360/SSI-2024-0050",

language = "繁体中文",

volume = "54",

pages = "1871--1883",

journal = "Scientia Sinica Informationis",

issn = "1674-7267",

publisher = "Science Press ",

number = "8",

}

TY - JOUR

T1 - 基于 MASAC 强化学习算法的多无人机协同路径规划

AU - Fang, Chengliang

AU - Yang, Feisheng

AU - Pan, Quan

PY - 2024

Y1 - 2024

N2 - This paper proposes a novel multi-agent deep reinforcement learning algorithm for the collaborative path planning problem of heterogeneous unmanned aerial vehicles (UAVs) in a dynamic uncertain environment. Firstly, a reinforcement learning environment for UAVs is developed to reach a target location in an airspace scenario, where the environment introduces the UAV dynamics equations and considers the UAV heterogeneity as well as the requirement for safe obstacle avoidance. Secondly, evaluation metrics including task completion rate, formation maintenance rate, flight time, flight trajectory, and energy consumption are designed to evaluate the algorithm performance. Then, the multi-UAV collaborative path planning problem is modeled as a partially observable Markov decision process and a multi-agent soft actor critic algorithm is proposed to seek the approximate optimal strategy for the problem. Finally, the effectiveness and superiority of the proposed algorithm are demonstrated through simulations.

AB - This paper proposes a novel multi-agent deep reinforcement learning algorithm for the collaborative path planning problem of heterogeneous unmanned aerial vehicles (UAVs) in a dynamic uncertain environment. Firstly, a reinforcement learning environment for UAVs is developed to reach a target location in an airspace scenario, where the environment introduces the UAV dynamics equations and considers the UAV heterogeneity as well as the requirement for safe obstacle avoidance. Secondly, evaluation metrics including task completion rate, formation maintenance rate, flight time, flight trajectory, and energy consumption are designed to evaluate the algorithm performance. Then, the multi-UAV collaborative path planning problem is modeled as a partially observable Markov decision process and a multi-agent soft actor critic algorithm is proposed to seek the approximate optimal strategy for the problem. Finally, the effectiveness and superiority of the proposed algorithm are demonstrated through simulations.

KW - multi-agent deep reinforcement learning

KW - multi-agent soft actor critic algorithm

KW - multi-UAV

KW - partially observable Markov decision process

KW - path planning

UR - http://www.scopus.com/inward/record.url?scp=85202673073&partnerID=8YFLogxK

U2 - 10.1360/SSI-2024-0050

DO - 10.1360/SSI-2024-0050

M3 - 文章

AN - SCOPUS:85202673073

SN - 1674-7267

VL - 54

SP - 1871

EP - 1883

JO - Scientia Sinica Informationis

JF - Scientia Sinica Informationis

IS - 8

ER -

基于 MASAC 强化学习算法的多无人机协同路径规划

Abstract

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this