TY - GEN
T1 - Mimicking Albatross Flight Strategies
T2 - 2nd Aerospace Frontiers Conference, AFC 2025
AU - Wang, Wei
AU - An, Weigang
AU - Song, Bifeng
AU - Yang, Wenqing
AU - Luo, Yang
N1 - Publisher Copyright:
© Press of Acta Aeronautica et Astronautica Sinica 2026.
PY - 2026
Y1 - 2026
N2 - Dynamic soaring is an energy-efficient flight strategy employed by albatrosses to traverse vast oceanic distances by exploiting wind gradients. Inspired by this natural phenomenon, this study proposes a deep reinforcement learning (DRL) framework for real-time trajectory planning of unmanned aerial vehicles (UAVs) performing dynamic soaring. The proposed method enables the UAV to autonomously plan and execute dynamic soaring trajectories, optimizing energy harvesting from the wind while meeting mission-specific requirements. Using the Deep Deterministic Policy Gradient (DDPG) algorithm, the trained agent successfully generates dynamic soaring paths that replicate the Rayleigh loop behavior, effectively harvesting energy while maintaining energy neutrality during flight. The study further identifies an interesting phenomenon where the agent plans windward climb trajectories and successfully harvests energy, which deviates from traditional dynamic soaring models. This discovery highlights the significance of the sideslip angle in energy harvesting, expanding the applicability of conventional dynamic soaring energy mechanisms. The results provide valuable insights into improving UAV dynamic soaring strategies and advancing reinforcement learning applications in energy-efficient autonomous flight.
AB - Dynamic soaring is an energy-efficient flight strategy employed by albatrosses to traverse vast oceanic distances by exploiting wind gradients. Inspired by this natural phenomenon, this study proposes a deep reinforcement learning (DRL) framework for real-time trajectory planning of unmanned aerial vehicles (UAVs) performing dynamic soaring. The proposed method enables the UAV to autonomously plan and execute dynamic soaring trajectories, optimizing energy harvesting from the wind while meeting mission-specific requirements. Using the Deep Deterministic Policy Gradient (DDPG) algorithm, the trained agent successfully generates dynamic soaring paths that replicate the Rayleigh loop behavior, effectively harvesting energy while maintaining energy neutrality during flight. The study further identifies an interesting phenomenon where the agent plans windward climb trajectories and successfully harvests energy, which deviates from traditional dynamic soaring models. This discovery highlights the significance of the sideslip angle in energy harvesting, expanding the applicability of conventional dynamic soaring energy mechanisms. The results provide valuable insights into improving UAV dynamic soaring strategies and advancing reinforcement learning applications in energy-efficient autonomous flight.
KW - Deep Reinforcement Learning
KW - Dynamic Soaring
KW - Path Planning
UR - https://www.scopus.com/pages/publications/105022870873
U2 - 10.1007/978-981-95-3016-8_18
DO - 10.1007/978-981-95-3016-8_18
M3 - 会议稿件
AN - SCOPUS:105022870873
SN - 9789819530151
T3 - Lecture Notes in Mechanical Engineering
SP - 260
EP - 275
BT - Proceedings of the 2nd Aerospace Frontiers Conference, AFC 2025 - Volume IV
PB - Springer Science and Business Media Deutschland GmbH
Y2 - 11 April 2025 through 14 April 2025
ER -