TY - JOUR
T1 - A RDA-Based Deep Reinforcement Learning Approach for Autonomous Motion Planning of UAV in Dynamic Unknown Environments
AU - Wan, Kaifang
AU - Gao, Xiaoguang
AU - Hu, Zijian
AU - Zhang, Wei
N1 - Publisher Copyright:
© 2020 IOP Publishing Ltd. All rights reserved.
PY - 2020/4/8
Y1 - 2020/4/8
N2 - Autonomous motion planning (AMP) in dynamic unknown environments emerges as an urgent requirement with the prosperity of unmanned aerial vehicle (UAV). In this paper, we present a DRL-based planning framework to address the AMP problem, which is applicable in both military and civilian fields. To maintain learning efficiency, a novel reward difference amplifying (RDA) scheme is proposed to reshape the conventional reward functions and is introduced into state-of-the-art DRLs to constructs novel DRL algorithms for the planner's learning. Different from conventional motion planning approaches, our DRL-based methods provide an end-to-end control for UAV, which directly maps the raw sensory measurements into high-level control signals. The training and testing experiments demonstrate that our RDA scheme makes great contributions to the performance improvement and provides the UAV good adaptability to dynamic environments.
AB - Autonomous motion planning (AMP) in dynamic unknown environments emerges as an urgent requirement with the prosperity of unmanned aerial vehicle (UAV). In this paper, we present a DRL-based planning framework to address the AMP problem, which is applicable in both military and civilian fields. To maintain learning efficiency, a novel reward difference amplifying (RDA) scheme is proposed to reshape the conventional reward functions and is introduced into state-of-the-art DRLs to constructs novel DRL algorithms for the planner's learning. Different from conventional motion planning approaches, our DRL-based methods provide an end-to-end control for UAV, which directly maps the raw sensory measurements into high-level control signals. The training and testing experiments demonstrate that our RDA scheme makes great contributions to the performance improvement and provides the UAV good adaptability to dynamic environments.
UR - http://www.scopus.com/inward/record.url?scp=85083495026&partnerID=8YFLogxK
U2 - 10.1088/1742-6596/1487/1/012006
DO - 10.1088/1742-6596/1487/1/012006
M3 - 会议文章
AN - SCOPUS:85083495026
SN - 1742-6588
VL - 1487
JO - Journal of Physics: Conference Series
JF - Journal of Physics: Conference Series
IS - 1
M1 - 012006
T2 - 2020 4th International Conference on Control Engineering and Artificial Intelligence, CCEAI 2020
Y2 - 17 January 2020 through 19 January 2020
ER -