TY - JOUR
T1 - Event-Triggered Hierarchical Planner for Autonomous Navigation in Unknown Environment
AU - Chen, Changhao
AU - Song, Bifeng
AU - Fu, Qiang
AU - Xue, Dong
AU - He, Lei
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/12
Y1 - 2023/12
N2 - End-to-end deep neural network (DNN)-based motion planners have shown great potential in high-speed autonomous UAV flight. Yet, most existing methods only employ a single high-capacity DNN, which typically lacks generalization ability and suffers from high sample complexity. We propose a novel event-triggered hierarchical planner (ETHP), which exploits the bi-level optimization nature of the navigation task to achieve both efficient training and improved optimality. Specifically, we learn a depth-image-based end-to-end motion planner in a hierarchical reinforcement learning framework, where the high-level DNN is a reactive collision avoidance rerouter triggered by the clearance distance, and the low-level DNN is a goal-chaser that generates the heading and velocity references in real time. Our training considers the field-of-view constraint and explores the bi-level structural flexibility to promote the spatio–temporal optimality of planning. Moreover, we design simple yet effective rules to collect hindsight experience replay buffers, yielding more high-quality samples and faster convergence. The experiments show that, compared with a single-DNN baseline planner, ETHP significantly improves the success rate and generalizes better to the unseen environment.
AB - End-to-end deep neural network (DNN)-based motion planners have shown great potential in high-speed autonomous UAV flight. Yet, most existing methods only employ a single high-capacity DNN, which typically lacks generalization ability and suffers from high sample complexity. We propose a novel event-triggered hierarchical planner (ETHP), which exploits the bi-level optimization nature of the navigation task to achieve both efficient training and improved optimality. Specifically, we learn a depth-image-based end-to-end motion planner in a hierarchical reinforcement learning framework, where the high-level DNN is a reactive collision avoidance rerouter triggered by the clearance distance, and the low-level DNN is a goal-chaser that generates the heading and velocity references in real time. Our training considers the field-of-view constraint and explores the bi-level structural flexibility to promote the spatio–temporal optimality of planning. Moreover, we design simple yet effective rules to collect hindsight experience replay buffers, yielding more high-quality samples and faster convergence. The experiments show that, compared with a single-DNN baseline planner, ETHP significantly improves the success rate and generalizes better to the unseen environment.
KW - autonomous UAV flight
KW - collision avoidance
KW - event-triggered
KW - hierarchical reinforcement learning
KW - unknown environment
UR - http://www.scopus.com/inward/record.url?scp=85180492063&partnerID=8YFLogxK
U2 - 10.3390/drones7120690
DO - 10.3390/drones7120690
M3 - 文章
AN - SCOPUS:85180492063
SN - 2504-446X
VL - 7
JO - Drones
JF - Drones
IS - 12
M1 - 690
ER -