TY - JOUR
T1 - UAV autonomous navigation with hybrid maneuver modes
T2 - A hierarchical reinforcement learning method
AU - Kou, Kai
AU - Yang, Gang
AU - Zhang, Wenqi
AU - Yao, Yuan
AU - Zhou, Xingshe
N1 - Publisher Copyright:
© The Author(s) 2026
PY - 2026
Y1 - 2026
N2 - Reinforcement learning has achieved significant progress in UAV autonomous navigation. However, existing methods typically rely on either purely discrete or purely continuous action spaces to define the UAV’s maneuver mode. Discrete action space is simple to implement and converge quickly but lack sufficient control granularity. In contrast, continuous action space provides higher control resolution but often leads to inefficient training and susceptibility to local optima. Existing RL methods cannot adaptively switch maneuver modes within a unified framework because discrete and continuous action spaces differ fundamentally in structure and control objectives. To address this issue, we propose a hierarchical reinforcement learning framework with hybrid action space (HAS-HRL). Specifically, the high-level policy adaptively selects the maneuver mode according to the environment context, while the low-level policy consists of a set of primitive navigation skills associated with the hybrid maneuver modes. These skills generate executable control commands, enabling the UAV to perform smooth maneuvers in dense obstacle regions while cruising efficiently in open spaces. Furthermore, an event-triggered control rule is introduced to provide structured prior guidance during the early training stage, thereby improving exploration efficiency and convergence stability. Experiments in various simulation environments demonstrate that the proposed HAS-HRL framework consistently outperforms single-layer RL and HRL baselines in terms of success rate, obstacle-avoidance performance, and training stability. The results show that the hybrid maneuver modes effectively balance flight safety and navigation efficiency, offering a robust and efficient solution for UAV autonomous navigation in complex scenarios.
AB - Reinforcement learning has achieved significant progress in UAV autonomous navigation. However, existing methods typically rely on either purely discrete or purely continuous action spaces to define the UAV’s maneuver mode. Discrete action space is simple to implement and converge quickly but lack sufficient control granularity. In contrast, continuous action space provides higher control resolution but often leads to inefficient training and susceptibility to local optima. Existing RL methods cannot adaptively switch maneuver modes within a unified framework because discrete and continuous action spaces differ fundamentally in structure and control objectives. To address this issue, we propose a hierarchical reinforcement learning framework with hybrid action space (HAS-HRL). Specifically, the high-level policy adaptively selects the maneuver mode according to the environment context, while the low-level policy consists of a set of primitive navigation skills associated with the hybrid maneuver modes. These skills generate executable control commands, enabling the UAV to perform smooth maneuvers in dense obstacle regions while cruising efficiently in open spaces. Furthermore, an event-triggered control rule is introduced to provide structured prior guidance during the early training stage, thereby improving exploration efficiency and convergence stability. Experiments in various simulation environments demonstrate that the proposed HAS-HRL framework consistently outperforms single-layer RL and HRL baselines in terms of success rate, obstacle-avoidance performance, and training stability. The results show that the hybrid maneuver modes effectively balance flight safety and navigation efficiency, offering a robust and efficient solution for UAV autonomous navigation in complex scenarios.
KW - event-triggered control rule
KW - hierarchical reinforcement learning
KW - hybrid action space
KW - hybrid maneuver modes
KW - UAV autonomous navigation
UR - https://www.scopus.com/pages/publications/105037093279
U2 - 10.1177/1088467X251408949
DO - 10.1177/1088467X251408949
M3 - 文章
AN - SCOPUS:105037093279
SN - 1088-467X
JO - Intelligent Data Analysis
JF - Intelligent Data Analysis
ER -