Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

Bo Li; Zhi peng Yang; Da qing Chen; Shi yang Liang; Hao Ma

doi:10.1016/j.dt.2020.11.014

Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

Bo Li, Zhi peng Yang, Da qing Chen, Shi yang Liang, Hao Ma

电子信息学院

科研成果: 期刊稿件 › 文章 › 同行评审

113 引用（Scopus）

摘要

Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles (UAVs). In this paper, aiming to address the control problem of maneuvering target tracking and obstacle avoidance, an online path planning approach for UAV is developed based on deep reinforcement learning. Through end-to-end learning powered by neural networks, the proposed approach can achieve the perception of the environment and continuous motion output control. This proposed approach includes: (1) A deep deterministic policy gradient (DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs; (2) An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning; and (3) An algorithm of task-decomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV's control model built based on MN-DDPG. The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV's flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.

源语言	英语
页（从-至）	457-466
页数	10
期刊	Defence Technology
卷	17
期	2
DOI	https://doi.org/10.1016/j.dt.2020.11.014
出版状态	已出版 - 4月 2021

访问文件

10.1016/j.dt.2020.11.014

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{a36c619a5ac048e0950188177bfbdd8f,

title = "Maneuvering target tracking of UAV based on MN-DDPG and transfer learning",

abstract = "Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles (UAVs). In this paper, aiming to address the control problem of maneuvering target tracking and obstacle avoidance, an online path planning approach for UAV is developed based on deep reinforcement learning. Through end-to-end learning powered by neural networks, the proposed approach can achieve the perception of the environment and continuous motion output control. This proposed approach includes: (1) A deep deterministic policy gradient (DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs; (2) An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning; and (3) An algorithm of task-decomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV's control model built based on MN-DDPG. The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV's flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.",

keywords = "Deep reinforcement learning, MN-DDPG, Maneuvering target tracking, Mixed noises, Transfer learning, UAVs",

author = "Bo Li and Yang, {Zhi peng} and Chen, {Da qing} and Liang, {Shi yang} and Hao Ma",

note = "Publisher Copyright: {\textcopyright} 2020 The Authors",

year = "2021",

month = apr,

doi = "10.1016/j.dt.2020.11.014",

language = "英语",

volume = "17",

pages = "457--466",

journal = "Defence Technology",

issn = "2096-3459",

publisher = "KeAi Communications Co.",

number = "2",

}

TY - JOUR

T1 - Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

AU - Li, Bo

AU - Yang, Zhi peng

AU - Chen, Da qing

AU - Liang, Shi yang

AU - Ma, Hao

PY - 2021/4

Y1 - 2021/4

N2 - Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles (UAVs). In this paper, aiming to address the control problem of maneuvering target tracking and obstacle avoidance, an online path planning approach for UAV is developed based on deep reinforcement learning. Through end-to-end learning powered by neural networks, the proposed approach can achieve the perception of the environment and continuous motion output control. This proposed approach includes: (1) A deep deterministic policy gradient (DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs; (2) An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning; and (3) An algorithm of task-decomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV's control model built based on MN-DDPG. The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV's flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.

AB - Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles (UAVs). In this paper, aiming to address the control problem of maneuvering target tracking and obstacle avoidance, an online path planning approach for UAV is developed based on deep reinforcement learning. Through end-to-end learning powered by neural networks, the proposed approach can achieve the perception of the environment and continuous motion output control. This proposed approach includes: (1) A deep deterministic policy gradient (DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs; (2) An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning; and (3) An algorithm of task-decomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV's control model built based on MN-DDPG. The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV's flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.

KW - Deep reinforcement learning

KW - MN-DDPG

KW - Maneuvering target tracking

KW - Mixed noises

KW - Transfer learning

KW - UAVs

UR - http://www.scopus.com/inward/record.url?scp=85097463865&partnerID=8YFLogxK

U2 - 10.1016/j.dt.2020.11.014

DO - 10.1016/j.dt.2020.11.014

M3 - 文章

AN - SCOPUS:85097463865

SN - 2096-3459

VL - 17

SP - 457

EP - 466

JO - Defence Technology

JF - Defence Technology

IS - 2

ER -

Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

摘要

访问文件

其它文件与链接

指纹

引用此