跳到主要导航 跳到搜索 跳到主要内容

Lightweight Obstacle Avoidance for Fixed-Wing UAVs Using Entropy-Aware PPO

  • Northwestern Polytechnical University Xian

科研成果: 期刊稿件文章同行评审

3 引用 (Scopus)

摘要

Obstacle avoidance during high-speed, low-altitude flight remains a significant challenge for unmanned aerial vehicles (UAVs), particularly in unfamiliar environments where prior maps and heavy onboard sensors are unavailable. To address this, we present an entropy-aware deep reinforcement learning framework that enables fixed-wing UAVs to navigate safely using only monocular onboard cameras. Our system features a lightweight, single-frame depth estimation module optimized for real-time execution on edge computing platforms, followed by a reinforcement learning controller equipped with a novel reward function that balances goal-reaching performance with path smoothness under fixed-wing dynamic constraints. To enhance policy optimization, we incorporate high-quality experiences from the replay buffer into the gradient computation, introducing a soft imitation mechanism that encourages the agent to align its behavior with previously successful actions. To further balance exploration and exploitation, we integrate an adaptive entropy regularization mechanism into the Proximal Policy Optimization (PPO) algorithm. This module dynamically adjusts policy entropy during training, leading to improved stability, faster convergence, and better generalization to unseen scenarios. Extensive software-in-the-loop (SITL) and hardware-in-the-loop (HITL) experiments demonstrate that our approach outperforms baseline methods in obstacle avoidance success rate and path quality, while remaining lightweight and deployable on resource-constrained aerial platforms.

源语言英语
文章编号598
期刊Drones
9
9
DOI
出版状态已出版 - 9月 2025

指纹

探究 'Lightweight Obstacle Avoidance for Fixed-Wing UAVs Using Entropy-Aware PPO' 的科研主题。它们共同构成独一无二的指纹。

引用此