Explainable Deep Reinforcement Learning for UAV autonomous path planning

Lei He; Nabil Aouf; Bifeng Song

doi:10.1016/j.ast.2021.107052

Explainable Deep Reinforcement Learning for UAV autonomous path planning

Lei He, Nabil Aouf, Bifeng Song

School of Aeronautics

Research output: Contribution to journal › Article › peer-review

135 Scopus citations

Abstract

Autonomous navigation in unknown environment is still a hard problem for small Unmanned Aerial Vehicles (UAVs). Recently, some neural network-based methods are proposed to tackle this problem, however, the trained network is opaque, non-intuitive and difficult for people to understand, which limits the real-world application. In this paper, a novel explainable deep neural network-based path planner is proposed for quadrotor to fly autonomously in unknown environment. The navigation problem is modelled as a Markov Decision Process (MDP) and the path planner is trained using Deep Reinforcement Learning (DRL) method in simulation environment. To get better understanding of the trained model, a novel model explanation method is proposed based on the feature attribution. Some easy-to-interpret textual and visual explanations are generated to allow end-users to understand what triggered a particular behaviour. Moreover, some global analyses are provided for experts to evaluate and improve the trained network. Finally, real-world flight tests are conducted to illustrate that our path planner trained in the simulation is robust enough to be applied in the real environment directly.

Original language	English
Article number	107052
Journal	Aerospace Science and Technology
Volume	118
DOIs	https://doi.org/10.1016/j.ast.2021.107052
State	Published - Nov 2021

Keywords

Autonomous navigation
Deep Reinforcement Learning (DRL)
Explainable AI
Unmanned Aerial Vehicles (UAVs)

Access to Document

10.1016/j.ast.2021.107052

Cite this

@article{44eb8f9794fc4cb0b3af71a256ba76a2,

title = "Explainable Deep Reinforcement Learning for UAV autonomous path planning",

abstract = "Autonomous navigation in unknown environment is still a hard problem for small Unmanned Aerial Vehicles (UAVs). Recently, some neural network-based methods are proposed to tackle this problem, however, the trained network is opaque, non-intuitive and difficult for people to understand, which limits the real-world application. In this paper, a novel explainable deep neural network-based path planner is proposed for quadrotor to fly autonomously in unknown environment. The navigation problem is modelled as a Markov Decision Process (MDP) and the path planner is trained using Deep Reinforcement Learning (DRL) method in simulation environment. To get better understanding of the trained model, a novel model explanation method is proposed based on the feature attribution. Some easy-to-interpret textual and visual explanations are generated to allow end-users to understand what triggered a particular behaviour. Moreover, some global analyses are provided for experts to evaluate and improve the trained network. Finally, real-world flight tests are conducted to illustrate that our path planner trained in the simulation is robust enough to be applied in the real environment directly.",

keywords = "Autonomous navigation, Deep Reinforcement Learning (DRL), Explainable AI, Unmanned Aerial Vehicles (UAVs)",

author = "Lei He and Nabil Aouf and Bifeng Song",

note = "Publisher Copyright: {\textcopyright} 2021 Elsevier Masson SAS",

year = "2021",

month = nov,

doi = "10.1016/j.ast.2021.107052",

language = "英语",

volume = "118",

journal = "Aerospace Science and Technology",

issn = "1270-9638",

publisher = "Elsevier Masson s.r.l.",

}

TY - JOUR

T1 - Explainable Deep Reinforcement Learning for UAV autonomous path planning

AU - He, Lei

AU - Aouf, Nabil

AU - Song, Bifeng

PY - 2021/11

Y1 - 2021/11

N2 - Autonomous navigation in unknown environment is still a hard problem for small Unmanned Aerial Vehicles (UAVs). Recently, some neural network-based methods are proposed to tackle this problem, however, the trained network is opaque, non-intuitive and difficult for people to understand, which limits the real-world application. In this paper, a novel explainable deep neural network-based path planner is proposed for quadrotor to fly autonomously in unknown environment. The navigation problem is modelled as a Markov Decision Process (MDP) and the path planner is trained using Deep Reinforcement Learning (DRL) method in simulation environment. To get better understanding of the trained model, a novel model explanation method is proposed based on the feature attribution. Some easy-to-interpret textual and visual explanations are generated to allow end-users to understand what triggered a particular behaviour. Moreover, some global analyses are provided for experts to evaluate and improve the trained network. Finally, real-world flight tests are conducted to illustrate that our path planner trained in the simulation is robust enough to be applied in the real environment directly.

AB - Autonomous navigation in unknown environment is still a hard problem for small Unmanned Aerial Vehicles (UAVs). Recently, some neural network-based methods are proposed to tackle this problem, however, the trained network is opaque, non-intuitive and difficult for people to understand, which limits the real-world application. In this paper, a novel explainable deep neural network-based path planner is proposed for quadrotor to fly autonomously in unknown environment. The navigation problem is modelled as a Markov Decision Process (MDP) and the path planner is trained using Deep Reinforcement Learning (DRL) method in simulation environment. To get better understanding of the trained model, a novel model explanation method is proposed based on the feature attribution. Some easy-to-interpret textual and visual explanations are generated to allow end-users to understand what triggered a particular behaviour. Moreover, some global analyses are provided for experts to evaluate and improve the trained network. Finally, real-world flight tests are conducted to illustrate that our path planner trained in the simulation is robust enough to be applied in the real environment directly.

KW - Autonomous navigation

KW - Deep Reinforcement Learning (DRL)

KW - Explainable AI

KW - Unmanned Aerial Vehicles (UAVs)

UR - http://www.scopus.com/inward/record.url?scp=85114628440&partnerID=8YFLogxK

U2 - 10.1016/j.ast.2021.107052

DO - 10.1016/j.ast.2021.107052

M3 - 文章

AN - SCOPUS:85114628440

SN - 1270-9638

VL - 118

JO - Aerospace Science and Technology

JF - Aerospace Science and Technology

M1 - 107052

ER -

Explainable Deep Reinforcement Learning for UAV autonomous path planning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this