A RDA-Based Deep Reinforcement Learning Approach for Autonomous Motion Planning of UAV in Dynamic Unknown Environments

Kaifang Wan; Xiaoguang Gao; Zijian Hu; Wei Zhang

doi:10.1088/1742-6596/1487/1/012006

A RDA-Based Deep Reinforcement Learning Approach for Autonomous Motion Planning of UAV in Dynamic Unknown Environments

Kaifang Wan, Xiaoguang Gao, Zijian Hu, Wei Zhang

School of Electronics and Information

Research output: Contribution to journal › Conference article › peer-review

2 Scopus citations

Abstract

Autonomous motion planning (AMP) in dynamic unknown environments emerges as an urgent requirement with the prosperity of unmanned aerial vehicle (UAV). In this paper, we present a DRL-based planning framework to address the AMP problem, which is applicable in both military and civilian fields. To maintain learning efficiency, a novel reward difference amplifying (RDA) scheme is proposed to reshape the conventional reward functions and is introduced into state-of-the-art DRLs to constructs novel DRL algorithms for the planner's learning. Different from conventional motion planning approaches, our DRL-based methods provide an end-to-end control for UAV, which directly maps the raw sensory measurements into high-level control signals. The training and testing experiments demonstrate that our RDA scheme makes great contributions to the performance improvement and provides the UAV good adaptability to dynamic environments.

Original language	English
Article number	012006
Journal	Journal of Physics: Conference Series
Volume	1487
Issue number	1
DOIs	https://doi.org/10.1088/1742-6596/1487/1/012006
State	Published - 8 Apr 2020
Event	2020 4th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2020 - Singapore, Singapore Duration: 17 Jan 2020 → 19 Jan 2020

Access to Document

10.1088/1742-6596/1487/1/012006

Cite this

@article{0f69f34af01345ffa245d3adeb9d1f50,

title = "A RDA-Based Deep Reinforcement Learning Approach for Autonomous Motion Planning of UAV in Dynamic Unknown Environments",

abstract = "Autonomous motion planning (AMP) in dynamic unknown environments emerges as an urgent requirement with the prosperity of unmanned aerial vehicle (UAV). In this paper, we present a DRL-based planning framework to address the AMP problem, which is applicable in both military and civilian fields. To maintain learning efficiency, a novel reward difference amplifying (RDA) scheme is proposed to reshape the conventional reward functions and is introduced into state-of-the-art DRLs to constructs novel DRL algorithms for the planner's learning. Different from conventional motion planning approaches, our DRL-based methods provide an end-to-end control for UAV, which directly maps the raw sensory measurements into high-level control signals. The training and testing experiments demonstrate that our RDA scheme makes great contributions to the performance improvement and provides the UAV good adaptability to dynamic environments.",

author = "Kaifang Wan and Xiaoguang Gao and Zijian Hu and Wei Zhang",

note = "Publisher Copyright: {\textcopyright} 2020 IOP Publishing Ltd. All rights reserved.; 2020 4th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2020 ; Conference date: 17-01-2020 Through 19-01-2020",

year = "2020",

month = apr,

day = "8",

doi = "10.1088/1742-6596/1487/1/012006",

language = "英语",

volume = "1487",

journal = "Journal of Physics: Conference Series",

issn = "1742-6588",

publisher = "IOP Publishing Ltd.",

number = "1",

}

TY - JOUR

T1 - A RDA-Based Deep Reinforcement Learning Approach for Autonomous Motion Planning of UAV in Dynamic Unknown Environments

AU - Wan, Kaifang

AU - Gao, Xiaoguang

AU - Hu, Zijian

AU - Zhang, Wei

PY - 2020/4/8

Y1 - 2020/4/8

N2 - Autonomous motion planning (AMP) in dynamic unknown environments emerges as an urgent requirement with the prosperity of unmanned aerial vehicle (UAV). In this paper, we present a DRL-based planning framework to address the AMP problem, which is applicable in both military and civilian fields. To maintain learning efficiency, a novel reward difference amplifying (RDA) scheme is proposed to reshape the conventional reward functions and is introduced into state-of-the-art DRLs to constructs novel DRL algorithms for the planner's learning. Different from conventional motion planning approaches, our DRL-based methods provide an end-to-end control for UAV, which directly maps the raw sensory measurements into high-level control signals. The training and testing experiments demonstrate that our RDA scheme makes great contributions to the performance improvement and provides the UAV good adaptability to dynamic environments.

AB - Autonomous motion planning (AMP) in dynamic unknown environments emerges as an urgent requirement with the prosperity of unmanned aerial vehicle (UAV). In this paper, we present a DRL-based planning framework to address the AMP problem, which is applicable in both military and civilian fields. To maintain learning efficiency, a novel reward difference amplifying (RDA) scheme is proposed to reshape the conventional reward functions and is introduced into state-of-the-art DRLs to constructs novel DRL algorithms for the planner's learning. Different from conventional motion planning approaches, our DRL-based methods provide an end-to-end control for UAV, which directly maps the raw sensory measurements into high-level control signals. The training and testing experiments demonstrate that our RDA scheme makes great contributions to the performance improvement and provides the UAV good adaptability to dynamic environments.

UR - http://www.scopus.com/inward/record.url?scp=85083495026&partnerID=8YFLogxK

U2 - 10.1088/1742-6596/1487/1/012006

DO - 10.1088/1742-6596/1487/1/012006

M3 - 会议文章

AN - SCOPUS:85083495026

SN - 1742-6588

VL - 1487

JO - Journal of Physics: Conference Series

JF - Journal of Physics: Conference Series

IS - 1

M1 - 012006

T2 - 2020 4th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2020

Y2 - 17 January 2020 through 19 January 2020

ER -

A RDA-Based Deep Reinforcement Learning Approach for Autonomous Motion Planning of UAV in Dynamic Unknown Environments

Abstract

Access to Document

Other files and links

Fingerprint

Cite this