TY - JOUR
T1 - Explainable Deep Reinforcement Learning for UAV autonomous path planning
AU - He, Lei
AU - Aouf, Nabil
AU - Song, Bifeng
N1 - Publisher Copyright:
© 2021 Elsevier Masson SAS
PY - 2021/11
Y1 - 2021/11
N2 - Autonomous navigation in unknown environment is still a hard problem for small Unmanned Aerial Vehicles (UAVs). Recently, some neural network-based methods are proposed to tackle this problem, however, the trained network is opaque, non-intuitive and difficult for people to understand, which limits the real-world application. In this paper, a novel explainable deep neural network-based path planner is proposed for quadrotor to fly autonomously in unknown environment. The navigation problem is modelled as a Markov Decision Process (MDP) and the path planner is trained using Deep Reinforcement Learning (DRL) method in simulation environment. To get better understanding of the trained model, a novel model explanation method is proposed based on the feature attribution. Some easy-to-interpret textual and visual explanations are generated to allow end-users to understand what triggered a particular behaviour. Moreover, some global analyses are provided for experts to evaluate and improve the trained network. Finally, real-world flight tests are conducted to illustrate that our path planner trained in the simulation is robust enough to be applied in the real environment directly.
AB - Autonomous navigation in unknown environment is still a hard problem for small Unmanned Aerial Vehicles (UAVs). Recently, some neural network-based methods are proposed to tackle this problem, however, the trained network is opaque, non-intuitive and difficult for people to understand, which limits the real-world application. In this paper, a novel explainable deep neural network-based path planner is proposed for quadrotor to fly autonomously in unknown environment. The navigation problem is modelled as a Markov Decision Process (MDP) and the path planner is trained using Deep Reinforcement Learning (DRL) method in simulation environment. To get better understanding of the trained model, a novel model explanation method is proposed based on the feature attribution. Some easy-to-interpret textual and visual explanations are generated to allow end-users to understand what triggered a particular behaviour. Moreover, some global analyses are provided for experts to evaluate and improve the trained network. Finally, real-world flight tests are conducted to illustrate that our path planner trained in the simulation is robust enough to be applied in the real environment directly.
KW - Autonomous navigation
KW - Deep Reinforcement Learning (DRL)
KW - Explainable AI
KW - Unmanned Aerial Vehicles (UAVs)
UR - http://www.scopus.com/inward/record.url?scp=85114628440&partnerID=8YFLogxK
U2 - 10.1016/j.ast.2021.107052
DO - 10.1016/j.ast.2021.107052
M3 - 文章
AN - SCOPUS:85114628440
SN - 1270-9638
VL - 118
JO - Aerospace Science and Technology
JF - Aerospace Science and Technology
M1 - 107052
ER -