TY - GEN
T1 - Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning
AU - Kou, Kai
AU - Yang, Gang
AU - Zhang, Wenqi
AU - Wang, Chenyi
AU - Yao, Yuan
AU - Zhou, Xingshe
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Autonomous navigation of unmanned aerial vehicle (UAV) is one of the fundamental yet completely solved problems in automatic control. In this paper, an option-based hierarchical reinforcement learning approach is proposed for UAV autonomous navigation. Specifically, the proposed method consists of a high-level and two low-level model, where the high level behavior selection model learns a stable and reliable behavior selection strategy automatically, while the low-level obstacle avoidance model and target-driven control model implement two behavior strategies, obstacle avoidance and target approach, respectively, thus avoiding the dependence on manually designed control rules. Furthermore, the proposed model is pre-trained on large public dataset, allowing the model to converge quickly in various complex unstructured flight environments. Extensive experiments show that the proposed method indicates an overall advantage in various evaluation metrics, which indicating that the proposed method has a strong generalization capability in autonomous navigation task of UAV.
AB - Autonomous navigation of unmanned aerial vehicle (UAV) is one of the fundamental yet completely solved problems in automatic control. In this paper, an option-based hierarchical reinforcement learning approach is proposed for UAV autonomous navigation. Specifically, the proposed method consists of a high-level and two low-level model, where the high level behavior selection model learns a stable and reliable behavior selection strategy automatically, while the low-level obstacle avoidance model and target-driven control model implement two behavior strategies, obstacle avoidance and target approach, respectively, thus avoiding the dependence on manually designed control rules. Furthermore, the proposed model is pre-trained on large public dataset, allowing the model to converge quickly in various complex unstructured flight environments. Extensive experiments show that the proposed method indicates an overall advantage in various evaluation metrics, which indicating that the proposed method has a strong generalization capability in autonomous navigation task of UAV.
KW - Autonomous Navigation
KW - Hierarchical Reinforcement Learning
KW - Unmanned Aerial Vehicle (UAV)
UR - http://www.scopus.com/inward/record.url?scp=85149434618&partnerID=8YFLogxK
U2 - 10.1109/ICARCE55724.2022.10046655
DO - 10.1109/ICARCE55724.2022.10046655
M3 - 会议稿件
AN - SCOPUS:85149434618
T3 - 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022
BT - 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022
Y2 - 16 December 2022 through 17 December 2022
ER -