Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning

Kai Kou; Gang Yang; Wenqi Zhang; Chenyi Wang; Yuan Yao; Xingshe Zhou

doi:10.1109/ICARCE55724.2022.10046655

Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning

Kai Kou, Gang Yang, Wenqi Zhang, Chenyi Wang, Yuan Yao, Xingshe Zhou

School of Computer Science

Northwestern Polytechnical University Xian

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

1 Scopus citations

Abstract

Autonomous navigation of unmanned aerial vehicle (UAV) is one of the fundamental yet completely solved problems in automatic control. In this paper, an option-based hierarchical reinforcement learning approach is proposed for UAV autonomous navigation. Specifically, the proposed method consists of a high-level and two low-level model, where the high level behavior selection model learns a stable and reliable behavior selection strategy automatically, while the low-level obstacle avoidance model and target-driven control model implement two behavior strategies, obstacle avoidance and target approach, respectively, thus avoiding the dependence on manually designed control rules. Furthermore, the proposed model is pre-trained on large public dataset, allowing the model to converge quickly in various complex unstructured flight environments. Extensive experiments show that the proposed method indicates an overall advantage in various evaluation metrics, which indicating that the proposed method has a strong generalization capability in autonomous navigation task of UAV.

Original language	English
Title of host publication	2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781665475488
DOIs	https://doi.org/10.1109/ICARCE55724.2022.10046655
State	Published - 2022
Event	2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022 - Virtual, Online, China Duration: 16 Dec 2022 → 17 Dec 2022

Publication series

Name	2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022

Conference

Conference	2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022
Country/Territory	China
City	Virtual, Online
Period	16/12/22 → 17/12/22

Keywords

Autonomous Navigation
Hierarchical Reinforcement Learning
Unmanned Aerial Vehicle (UAV)

Access to Document

10.1109/ICARCE55724.2022.10046655

Cite this

Kou, K., Yang, G., Zhang, W., Wang, C., Yao, Y., & Zhou, X. (2022). Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning. In 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022 (2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICARCE55724.2022.10046655

Kou, Kai ; Yang, Gang ; Zhang, Wenqi et al. / Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning. 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022. Institute of Electrical and Electronics Engineers Inc., 2022. (2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022).

@inproceedings{03e27f75ea6648f09716975bb50d713a,

title = "Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning",

abstract = "Autonomous navigation of unmanned aerial vehicle (UAV) is one of the fundamental yet completely solved problems in automatic control. In this paper, an option-based hierarchical reinforcement learning approach is proposed for UAV autonomous navigation. Specifically, the proposed method consists of a high-level and two low-level model, where the high level behavior selection model learns a stable and reliable behavior selection strategy automatically, while the low-level obstacle avoidance model and target-driven control model implement two behavior strategies, obstacle avoidance and target approach, respectively, thus avoiding the dependence on manually designed control rules. Furthermore, the proposed model is pre-trained on large public dataset, allowing the model to converge quickly in various complex unstructured flight environments. Extensive experiments show that the proposed method indicates an overall advantage in various evaluation metrics, which indicating that the proposed method has a strong generalization capability in autonomous navigation task of UAV.",

keywords = "Autonomous Navigation, Hierarchical Reinforcement Learning, Unmanned Aerial Vehicle (UAV)",

author = "Kai Kou and Gang Yang and Wenqi Zhang and Chenyi Wang and Yuan Yao and Xingshe Zhou",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022 ; Conference date: 16-12-2022 Through 17-12-2022",

year = "2022",

doi = "10.1109/ICARCE55724.2022.10046655",

language = "英语",

series = "2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022",

}

Kou, K, Yang, G, Zhang, W, Wang, C, Yao, Y & Zhou, X 2022, Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning. in 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022. 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022, Institute of Electrical and Electronics Engineers Inc., 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022, Virtual, Online, China, 16/12/22. https://doi.org/10.1109/ICARCE55724.2022.10046655

Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning. / Kou, Kai; Yang, Gang; Zhang, Wenqi et al.
2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022. Institute of Electrical and Electronics Engineers Inc., 2022. (2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning

AU - Kou, Kai

AU - Yang, Gang

AU - Zhang, Wenqi

AU - Wang, Chenyi

AU - Yao, Yuan

AU - Zhou, Xingshe

PY - 2022

Y1 - 2022

N2 - Autonomous navigation of unmanned aerial vehicle (UAV) is one of the fundamental yet completely solved problems in automatic control. In this paper, an option-based hierarchical reinforcement learning approach is proposed for UAV autonomous navigation. Specifically, the proposed method consists of a high-level and two low-level model, where the high level behavior selection model learns a stable and reliable behavior selection strategy automatically, while the low-level obstacle avoidance model and target-driven control model implement two behavior strategies, obstacle avoidance and target approach, respectively, thus avoiding the dependence on manually designed control rules. Furthermore, the proposed model is pre-trained on large public dataset, allowing the model to converge quickly in various complex unstructured flight environments. Extensive experiments show that the proposed method indicates an overall advantage in various evaluation metrics, which indicating that the proposed method has a strong generalization capability in autonomous navigation task of UAV.

AB - Autonomous navigation of unmanned aerial vehicle (UAV) is one of the fundamental yet completely solved problems in automatic control. In this paper, an option-based hierarchical reinforcement learning approach is proposed for UAV autonomous navigation. Specifically, the proposed method consists of a high-level and two low-level model, where the high level behavior selection model learns a stable and reliable behavior selection strategy automatically, while the low-level obstacle avoidance model and target-driven control model implement two behavior strategies, obstacle avoidance and target approach, respectively, thus avoiding the dependence on manually designed control rules. Furthermore, the proposed model is pre-trained on large public dataset, allowing the model to converge quickly in various complex unstructured flight environments. Extensive experiments show that the proposed method indicates an overall advantage in various evaluation metrics, which indicating that the proposed method has a strong generalization capability in autonomous navigation task of UAV.

KW - Autonomous Navigation

KW - Hierarchical Reinforcement Learning

KW - Unmanned Aerial Vehicle (UAV)

UR - http://www.scopus.com/inward/record.url?scp=85149434618&partnerID=8YFLogxK

U2 - 10.1109/ICARCE55724.2022.10046655

DO - 10.1109/ICARCE55724.2022.10046655

M3 - 会议稿件

AN - SCOPUS:85149434618

T3 - 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022

BT - 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022

Y2 - 16 December 2022 through 17 December 2022

ER -

Kou K, Yang G, Zhang W, Wang C, Yao Y, Zhou X. Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning. In 2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022. Institute of Electrical and Electronics Engineers Inc. 2022. (2022 International Conference on Automation, Robotics and Computer Engineering, ICARCE 2022). doi: 10.1109/ICARCE55724.2022.10046655

Autonomous Navigation of UAV in Dynamic Unstructured Environments via Hierarchical Reinforcement Learning

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this