Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

Bo Li; Zhi peng Yang; Da qing Chen; Shi yang Liang; Hao Ma

doi:10.1016/j.dt.2020.11.014

Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

Bo Li, Zhi peng Yang, Da qing Chen, Shi yang Liang, Hao Ma

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

114 Scopus citations

Abstract

Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles (UAVs). In this paper, aiming to address the control problem of maneuvering target tracking and obstacle avoidance, an online path planning approach for UAV is developed based on deep reinforcement learning. Through end-to-end learning powered by neural networks, the proposed approach can achieve the perception of the environment and continuous motion output control. This proposed approach includes: (1) A deep deterministic policy gradient (DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs; (2) An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning; and (3) An algorithm of task-decomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV's control model built based on MN-DDPG. The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV's flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.

Original language	English
Pages (from-to)	457-466
Number of pages	10
Journal	Defence Technology
Volume	17
Issue number	2
DOIs	https://doi.org/10.1016/j.dt.2020.11.014
State	Published - Apr 2021

Keywords

Deep reinforcement learning
MN-DDPG
Maneuvering target tracking
Mixed noises
Transfer learning
UAVs

Access to Document

10.1016/j.dt.2020.11.014

Cite this

@article{a36c619a5ac048e0950188177bfbdd8f,

title = "Maneuvering target tracking of UAV based on MN-DDPG and transfer learning",

abstract = "Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles (UAVs). In this paper, aiming to address the control problem of maneuvering target tracking and obstacle avoidance, an online path planning approach for UAV is developed based on deep reinforcement learning. Through end-to-end learning powered by neural networks, the proposed approach can achieve the perception of the environment and continuous motion output control. This proposed approach includes: (1) A deep deterministic policy gradient (DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs; (2) An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning; and (3) An algorithm of task-decomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV's control model built based on MN-DDPG. The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV's flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.",

keywords = "Deep reinforcement learning, MN-DDPG, Maneuvering target tracking, Mixed noises, Transfer learning, UAVs",

author = "Bo Li and Yang, {Zhi peng} and Chen, {Da qing} and Liang, {Shi yang} and Hao Ma",

note = "Publisher Copyright: {\textcopyright} 2020 The Authors",

year = "2021",

month = apr,

doi = "10.1016/j.dt.2020.11.014",

language = "英语",

volume = "17",

pages = "457--466",

journal = "Defence Technology",

issn = "2096-3459",

publisher = "KeAi Communications Co.",

number = "2",

}

TY - JOUR

T1 - Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

AU - Li, Bo

AU - Yang, Zhi peng

AU - Chen, Da qing

AU - Liang, Shi yang

AU - Ma, Hao

PY - 2021/4

Y1 - 2021/4

N2 - Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles (UAVs). In this paper, aiming to address the control problem of maneuvering target tracking and obstacle avoidance, an online path planning approach for UAV is developed based on deep reinforcement learning. Through end-to-end learning powered by neural networks, the proposed approach can achieve the perception of the environment and continuous motion output control. This proposed approach includes: (1) A deep deterministic policy gradient (DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs; (2) An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning; and (3) An algorithm of task-decomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV's control model built based on MN-DDPG. The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV's flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.

AB - Tracking maneuvering target in real time autonomously and accurately in an uncertain environment is one of the challenging missions for unmanned aerial vehicles (UAVs). In this paper, aiming to address the control problem of maneuvering target tracking and obstacle avoidance, an online path planning approach for UAV is developed based on deep reinforcement learning. Through end-to-end learning powered by neural networks, the proposed approach can achieve the perception of the environment and continuous motion output control. This proposed approach includes: (1) A deep deterministic policy gradient (DDPG)-based control framework to provide learning and autonomous decision-making capability for UAVs; (2) An improved method named MN-DDPG for introducing a type of mixed noises to assist UAV with exploring stochastic strategies for online optimal planning; and (3) An algorithm of task-decomposition and pre-training for efficient transfer learning to improve the generalization capability of UAV's control model built based on MN-DDPG. The experimental simulation results have verified that the proposed approach can achieve good self-adaptive adjustment of UAV's flight attitude in the tasks of maneuvering target tracking with a significant improvement in generalization capability and training efficiency of UAV tracking controller in uncertain environments.

KW - Deep reinforcement learning

KW - MN-DDPG

KW - Maneuvering target tracking

KW - Mixed noises

KW - Transfer learning

KW - UAVs

UR - http://www.scopus.com/inward/record.url?scp=85097463865&partnerID=8YFLogxK

U2 - 10.1016/j.dt.2020.11.014

DO - 10.1016/j.dt.2020.11.014

M3 - 文章

AN - SCOPUS:85097463865

SN - 2096-3459

VL - 17

SP - 457

EP - 466

JO - Defence Technology

JF - Defence Technology

IS - 2

ER -

Maneuvering target tracking of UAV based on MN-DDPG and transfer learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this