拒止环境下基于深度强化学习的多无人机协同定位

Kaifang Wan; Zhilin Wu; Yunhui Wu; Haozhi Qiang; Yibo Wu; Bo Li

doi:10.7527/S1000-6893.2024.31024

拒止环境下基于深度强化学习的多无人机协同定位

Kaifang Wan, Zhilin Wu, Yunhui Wu, Haozhi Qiang, Yibo Wu, Bo Li

电子信息学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

In strong adversarial scenarios, Unmanned Aerial Vehicles(UAVs)often experience GPS malfunction due to interference, making it difficult to obtain their accurate position. Since UAVs often operate in formations or clusters, this paper proposes a strategy that relies on UAVs within the formation to measure relative spatial positions and locate each other, allowing UAVs to update their position information in real time even after GPS signal loss. Firstly, in response to the GPS-denied environment, the theory of the Partially Observable Markov Decision Process(POMDP)is introduced and the elements of POMDP are analyzed to establish a POMDP decision model based on collaborative positioning and scheduling is established. A belief state update method based on the Extended Kalman Filter(EKF), as well as a Q-value estimation method based on Deep Q-Network(DQN)in deep reinforcement learning, is proposed to achieve accurate collaborative real-time positioning. Application tests in different scenarios show that the proposed model can achieve efficient management and scheduling of UAVs in formation, and can control GPS normal UAVs to effectively coordinate and locate GPS failed UAVs, which verifies the effectiveness of the model.

投稿的翻译标题	Cooperative location of multiple UAVs with deep reinforcement learning in GPS-denied environment
源语言	繁体中文
文章编号	331024
期刊	Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica
卷	46
期	8
DOI	https://doi.org/10.7527/S1000-6893.2024.31024
出版状态	已出版 - 25 4月 2025

关键词

collaborative positioning
deep reinforcement learning
GPS-denied
Markov decision
multiple UAVs

访问文件

10.7527/S1000-6893.2024.31024

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{553b8af62fc0410f9fdba959c5e37238,

title = "拒止环境下基于深度强化学习的多无人机协同定位",

abstract = "In strong adversarial scenarios, Unmanned Aerial Vehicles(UAVs)often experience GPS malfunction due to interference, making it difficult to obtain their accurate position. Since UAVs often operate in formations or clusters, this paper proposes a strategy that relies on UAVs within the formation to measure relative spatial positions and locate each other, allowing UAVs to update their position information in real time even after GPS signal loss. Firstly, in response to the GPS-denied environment, the theory of the Partially Observable Markov Decision Process(POMDP)is introduced and the elements of POMDP are analyzed to establish a POMDP decision model based on collaborative positioning and scheduling is established. A belief state update method based on the Extended Kalman Filter(EKF), as well as a Q-value estimation method based on Deep Q-Network(DQN)in deep reinforcement learning, is proposed to achieve accurate collaborative real-time positioning. Application tests in different scenarios show that the proposed model can achieve efficient management and scheduling of UAVs in formation, and can control GPS normal UAVs to effectively coordinate and locate GPS failed UAVs, which verifies the effectiveness of the model.",

keywords = "collaborative positioning, deep reinforcement learning, GPS-denied, Markov decision, multiple UAVs",

author = "Kaifang Wan and Zhilin Wu and Yunhui Wu and Haozhi Qiang and Yibo Wu and Bo Li",

year = "2025",

month = apr,

day = "25",

doi = "10.7527/S1000-6893.2024.31024",

language = "繁体中文",

volume = "46",

journal = "Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica",

issn = "1000-6893",

publisher = "AAAS Press of Chinese Society of Aeronautics and Astronautics",

number = "8",

}

TY - JOUR

T1 - 拒止环境下基于深度强化学习的多无人机协同定位

AU - Wan, Kaifang

AU - Wu, Zhilin

AU - Wu, Yunhui

AU - Qiang, Haozhi

AU - Wu, Yibo

AU - Li, Bo

PY - 2025/4/25

Y1 - 2025/4/25

N2 - In strong adversarial scenarios, Unmanned Aerial Vehicles(UAVs)often experience GPS malfunction due to interference, making it difficult to obtain their accurate position. Since UAVs often operate in formations or clusters, this paper proposes a strategy that relies on UAVs within the formation to measure relative spatial positions and locate each other, allowing UAVs to update their position information in real time even after GPS signal loss. Firstly, in response to the GPS-denied environment, the theory of the Partially Observable Markov Decision Process(POMDP)is introduced and the elements of POMDP are analyzed to establish a POMDP decision model based on collaborative positioning and scheduling is established. A belief state update method based on the Extended Kalman Filter(EKF), as well as a Q-value estimation method based on Deep Q-Network(DQN)in deep reinforcement learning, is proposed to achieve accurate collaborative real-time positioning. Application tests in different scenarios show that the proposed model can achieve efficient management and scheduling of UAVs in formation, and can control GPS normal UAVs to effectively coordinate and locate GPS failed UAVs, which verifies the effectiveness of the model.

AB - In strong adversarial scenarios, Unmanned Aerial Vehicles(UAVs)often experience GPS malfunction due to interference, making it difficult to obtain their accurate position. Since UAVs often operate in formations or clusters, this paper proposes a strategy that relies on UAVs within the formation to measure relative spatial positions and locate each other, allowing UAVs to update their position information in real time even after GPS signal loss. Firstly, in response to the GPS-denied environment, the theory of the Partially Observable Markov Decision Process(POMDP)is introduced and the elements of POMDP are analyzed to establish a POMDP decision model based on collaborative positioning and scheduling is established. A belief state update method based on the Extended Kalman Filter(EKF), as well as a Q-value estimation method based on Deep Q-Network(DQN)in deep reinforcement learning, is proposed to achieve accurate collaborative real-time positioning. Application tests in different scenarios show that the proposed model can achieve efficient management and scheduling of UAVs in formation, and can control GPS normal UAVs to effectively coordinate and locate GPS failed UAVs, which verifies the effectiveness of the model.

KW - collaborative positioning

KW - deep reinforcement learning

KW - GPS-denied

KW - Markov decision

KW - multiple UAVs

UR - http://www.scopus.com/inward/record.url?scp=105006720752&partnerID=8YFLogxK

U2 - 10.7527/S1000-6893.2024.31024

DO - 10.7527/S1000-6893.2024.31024

M3 - 文章

AN - SCOPUS:105006720752

SN - 1000-6893

VL - 46

JO - Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica

JF - Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica

IS - 8

M1 - 331024

ER -

拒止环境下基于深度强化学习的多无人机协同定位

摘要

关键词

访问文件

其它文件与链接

指纹

引用此