An end-to-end Flight Control Method for UAVs Based on MD-SAC

Chao Song; Yi Zhang; Shuangxia Bai; Bo Li; Zhigang Gan; Evgeny Neretin

doi:10.1109/TCE.2025.3541747

An end-to-end Flight Control Method for UAVs Based on MD-SAC

Chao Song, Yi Zhang, Shuangxia Bai, Bo Li, Zhigang Gan, Evgeny Neretin

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

Abstract

Deep reinforcement learning (DRL) allows unmanned aerial vehicles (UAVs) to learn control policies for tasks in complicated and unfamiliar environments, hence it is widely employed in the field of UAV flight control. However, the model and operational environment of UAVs are typically simplified, rendering them unrepresentative of the real world. Furthermore, using only a single sensory data to control UAV flight is difficult to realize autonomous decision-making of UAVs. In this paper, an end-to-end flight control method for UAVs based on multimodal data fusion and Soft Actor-Critic (SAC) algorithm is proposed, named MD-SAC. First, this paper constructs the UAV model that is basically consistent with the real physical model, and forms a UAV multidata fusion state space including UAV information, UAV and target information and UAV sensor sensing information. Then, the strategy of directly mapping the multimodal data fusion results to the UAV torque and thrust is proposed to construct an end-to-end UAV hierarchical control model, and the convergence of the control method is accelerated based on the empirical playback mechanism. The experimental results show that the UAV based on the MD-SAC algorithm can effectively complete autonomous trajectory planning and adapt to a variety of complex environments, and the performance is improved in terms of robustness and generalization compared with the PPO algorithm and the optimized SAC algorithm.

Original language	English
Journal	IEEE Transactions on Consumer Electronics
DOIs	https://doi.org/10.1109/TCE.2025.3541747
State	Accepted/In press - 2025

Keywords

Deep reinforcement learning
Multimodal data fusion
Optimising SAC algorithm
Perception and autonomy

Access to Document

10.1109/TCE.2025.3541747

Cite this

@article{6a44980d2c2944f791d53cf4997bccff,

title = "An end-to-end Flight Control Method for UAVs Based on MD-SAC",

abstract = "Deep reinforcement learning (DRL) allows unmanned aerial vehicles (UAVs) to learn control policies for tasks in complicated and unfamiliar environments, hence it is widely employed in the field of UAV flight control. However, the model and operational environment of UAVs are typically simplified, rendering them unrepresentative of the real world. Furthermore, using only a single sensory data to control UAV flight is difficult to realize autonomous decision-making of UAVs. In this paper, an end-to-end flight control method for UAVs based on multimodal data fusion and Soft Actor-Critic (SAC) algorithm is proposed, named MD-SAC. First, this paper constructs the UAV model that is basically consistent with the real physical model, and forms a UAV multidata fusion state space including UAV information, UAV and target information and UAV sensor sensing information. Then, the strategy of directly mapping the multimodal data fusion results to the UAV torque and thrust is proposed to construct an end-to-end UAV hierarchical control model, and the convergence of the control method is accelerated based on the empirical playback mechanism. The experimental results show that the UAV based on the MD-SAC algorithm can effectively complete autonomous trajectory planning and adapt to a variety of complex environments, and the performance is improved in terms of robustness and generalization compared with the PPO algorithm and the optimized SAC algorithm.",

keywords = "Deep reinforcement learning, Multimodal data fusion, Optimising SAC algorithm, Perception and autonomy",

author = "Chao Song and Yi Zhang and Shuangxia Bai and Bo Li and Zhigang Gan and Evgeny Neretin",

year = "2025",

doi = "10.1109/TCE.2025.3541747",

language = "英语",

journal = "IEEE Transactions on Consumer Electronics",

issn = "0098-3063",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - An end-to-end Flight Control Method for UAVs Based on MD-SAC

AU - Song, Chao

AU - Zhang, Yi

AU - Bai, Shuangxia

AU - Li, Bo

AU - Gan, Zhigang

AU - Neretin, Evgeny

PY - 2025

Y1 - 2025

N2 - Deep reinforcement learning (DRL) allows unmanned aerial vehicles (UAVs) to learn control policies for tasks in complicated and unfamiliar environments, hence it is widely employed in the field of UAV flight control. However, the model and operational environment of UAVs are typically simplified, rendering them unrepresentative of the real world. Furthermore, using only a single sensory data to control UAV flight is difficult to realize autonomous decision-making of UAVs. In this paper, an end-to-end flight control method for UAVs based on multimodal data fusion and Soft Actor-Critic (SAC) algorithm is proposed, named MD-SAC. First, this paper constructs the UAV model that is basically consistent with the real physical model, and forms a UAV multidata fusion state space including UAV information, UAV and target information and UAV sensor sensing information. Then, the strategy of directly mapping the multimodal data fusion results to the UAV torque and thrust is proposed to construct an end-to-end UAV hierarchical control model, and the convergence of the control method is accelerated based on the empirical playback mechanism. The experimental results show that the UAV based on the MD-SAC algorithm can effectively complete autonomous trajectory planning and adapt to a variety of complex environments, and the performance is improved in terms of robustness and generalization compared with the PPO algorithm and the optimized SAC algorithm.

AB - Deep reinforcement learning (DRL) allows unmanned aerial vehicles (UAVs) to learn control policies for tasks in complicated and unfamiliar environments, hence it is widely employed in the field of UAV flight control. However, the model and operational environment of UAVs are typically simplified, rendering them unrepresentative of the real world. Furthermore, using only a single sensory data to control UAV flight is difficult to realize autonomous decision-making of UAVs. In this paper, an end-to-end flight control method for UAVs based on multimodal data fusion and Soft Actor-Critic (SAC) algorithm is proposed, named MD-SAC. First, this paper constructs the UAV model that is basically consistent with the real physical model, and forms a UAV multidata fusion state space including UAV information, UAV and target information and UAV sensor sensing information. Then, the strategy of directly mapping the multimodal data fusion results to the UAV torque and thrust is proposed to construct an end-to-end UAV hierarchical control model, and the convergence of the control method is accelerated based on the empirical playback mechanism. The experimental results show that the UAV based on the MD-SAC algorithm can effectively complete autonomous trajectory planning and adapt to a variety of complex environments, and the performance is improved in terms of robustness and generalization compared with the PPO algorithm and the optimized SAC algorithm.

KW - Deep reinforcement learning

KW - Multimodal data fusion

KW - Optimising SAC algorithm

KW - Perception and autonomy

UR - http://www.scopus.com/inward/record.url?scp=85217913375&partnerID=8YFLogxK

U2 - 10.1109/TCE.2025.3541747

DO - 10.1109/TCE.2025.3541747

M3 - 文章

AN - SCOPUS:85217913375

SN - 0098-3063

JO - IEEE Transactions on Consumer Electronics

JF - IEEE Transactions on Consumer Electronics

ER -

An end-to-end Flight Control Method for UAVs Based on MD-SAC

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this