TY - JOUR
T1 - A CCO–PPO Framework for Autonomous UAV Trajectory Tracking in Complex and Disturbed Environments
AU - Guo, Xize
AU - Fan, Chao
AU - Shao, Boxuan
AU - Deng, Qi
AU - Chen, Jiahao
AU - Zhang, Tao
AU - Zhang, Wentao
N1 - Publisher Copyright:
© 2026 by the authors.
PY - 2026/5
Y1 - 2026/5
N2 - Accurate trajectory tracking is fundamental to the autonomous operation of unmanned aerial vehicles (UAVs) in complex tasks. While proximal policy optimization (PPO) has shown strong potential in UAV control, its performance is highly sensitive to hyperparameter configuration, and manual tuning is time-consuming due to complex interparameter coupling. This paper proposes CCO–PPO, a framework integrating the cuckoo catfish optimizer (CCO) with PPO for automatic hyperparameter optimization in UAV trajectory tracking. The problem is formulated as a Markov decision process with a 20-dimensional state space, and the CCO performs offline search over a four-dimensional hyperparameter space. Evaluated across seven test environments covering diverse trajectory geometries, wind disturbances, sensor noise, and large-scale scenarios, CCO–PPO achieves the lowest tracking error in all cases. Performance gains over baseline PPO increase monotonically with task complexity, reaching 18.8% under combined wind disturbance and sensor noise, with statistically significant advantages in 85.7% of pairwise comparisons against baseline PPO, SAC, and TD3. Ablation studies confirm that joint optimization of all four hyperparameters is essential under high-disturbance conditions, and comparisons with Bayesian optimization validate the CCO’s superior cross-seed stability. These results demonstrate that metaheuristic hyperparameter optimization substantially enhances policy robustness in high-disturbance UAV trajectory tracking scenarios.
AB - Accurate trajectory tracking is fundamental to the autonomous operation of unmanned aerial vehicles (UAVs) in complex tasks. While proximal policy optimization (PPO) has shown strong potential in UAV control, its performance is highly sensitive to hyperparameter configuration, and manual tuning is time-consuming due to complex interparameter coupling. This paper proposes CCO–PPO, a framework integrating the cuckoo catfish optimizer (CCO) with PPO for automatic hyperparameter optimization in UAV trajectory tracking. The problem is formulated as a Markov decision process with a 20-dimensional state space, and the CCO performs offline search over a four-dimensional hyperparameter space. Evaluated across seven test environments covering diverse trajectory geometries, wind disturbances, sensor noise, and large-scale scenarios, CCO–PPO achieves the lowest tracking error in all cases. Performance gains over baseline PPO increase monotonically with task complexity, reaching 18.8% under combined wind disturbance and sensor noise, with statistically significant advantages in 85.7% of pairwise comparisons against baseline PPO, SAC, and TD3. Ablation studies confirm that joint optimization of all four hyperparameters is essential under high-disturbance conditions, and comparisons with Bayesian optimization validate the CCO’s superior cross-seed stability. These results demonstrate that metaheuristic hyperparameter optimization substantially enhances policy robustness in high-disturbance UAV trajectory tracking scenarios.
KW - cuckoo catfish optimizer
KW - hyperparameter optimization
KW - metaheuristic algorithm
KW - proximal policy optimization
KW - reinforcement learning
KW - trajectory tracking
KW - unmanned aerial vehicle
UR - https://www.scopus.com/pages/publications/105038591288
U2 - 10.3390/s26092735
DO - 10.3390/s26092735
M3 - 文章
C2 - 42122457
AN - SCOPUS:105038591288
SN - 1424-8220
VL - 26
JO - Sensors
JF - Sensors
IS - 9
M1 - 2735
ER -