Distributed Electric Propulsion Aircraft Attitude Control Based on Assisted Training Multi-Agent Reinforcement Learning

Shengzhao Pang; Heng Liu; Yingxue Chen; Bo Cheng; Zhaoyong Mao; Yigeng Huangfu

doi:10.1109/TTE.2025.3576166

Distributed Electric Propulsion Aircraft Attitude Control Based on Assisted Training Multi-Agent Reinforcement Learning

Shengzhao Pang, Heng Liu, Yingxue Chen, Bo Cheng, Zhaoyong Mao, Yigeng Huangfu

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

Abstract

Distributed Electric Propulsion (DEP) aircraft generate differential thrust by rotational speed differential between distributed electric propulsors. This creates additional control torque, which offers the potential for improved flight performance. However, DEP aircraft is a highly coupled and complex system with multiple inputs and outputs. It is crucial to ensure that the different controlled actuators (distributed electric propulsors, rudder, elevator, and ailerons) work in a coordinated manner. To this end, this paper proposes a Memorable Fuzzy Assisted Training Twin Delayed Deep Deterministic Policy Gradient (MFAT-TD3) algorithm. The MFAT-TD3 algorithm reduces the training time and increases the rate of convergence of the reinforcement learning algorithm through a method of decoupling first and then coupling. This also enhances the generalization performance of individual agents, thus achieving the coordination and cooperation between different controlled actuators of the DEP aircraft, so that the aircraft can fly with high agility. The results show that the proposed algorithm can reduce training time, improve flight efficiency and fault tolerance. Even when there are multiple propulsors failed, it is still able to accomplish the combined power yaw control.

Original language	English
Journal	IEEE Transactions on Transportation Electrification
DOIs	https://doi.org/10.1109/TTE.2025.3576166
State	Accepted/In press - 2025

Keywords

attitude control
Distributed electric propulsion
reinforcement learning
twin delayed deep deterministic policy gradient
yaw control

Access to Document

10.1109/TTE.2025.3576166

Cite this

@article{c4787055b67048c2bc707f8ba0eb2d12,

title = "Distributed Electric Propulsion Aircraft Attitude Control Based on Assisted Training Multi-Agent Reinforcement Learning",

abstract = "Distributed Electric Propulsion (DEP) aircraft generate differential thrust by rotational speed differential between distributed electric propulsors. This creates additional control torque, which offers the potential for improved flight performance. However, DEP aircraft is a highly coupled and complex system with multiple inputs and outputs. It is crucial to ensure that the different controlled actuators (distributed electric propulsors, rudder, elevator, and ailerons) work in a coordinated manner. To this end, this paper proposes a Memorable Fuzzy Assisted Training Twin Delayed Deep Deterministic Policy Gradient (MFAT-TD3) algorithm. The MFAT-TD3 algorithm reduces the training time and increases the rate of convergence of the reinforcement learning algorithm through a method of decoupling first and then coupling. This also enhances the generalization performance of individual agents, thus achieving the coordination and cooperation between different controlled actuators of the DEP aircraft, so that the aircraft can fly with high agility. The results show that the proposed algorithm can reduce training time, improve flight efficiency and fault tolerance. Even when there are multiple propulsors failed, it is still able to accomplish the combined power yaw control.",

keywords = "attitude control, Distributed electric propulsion, reinforcement learning, twin delayed deep deterministic policy gradient, yaw control",

author = "Shengzhao Pang and Heng Liu and Yingxue Chen and Bo Cheng and Zhaoyong Mao and Yigeng Huangfu",

note = "Publisher Copyright: {\textcopyright} 2015 IEEE.",

year = "2025",

doi = "10.1109/TTE.2025.3576166",

language = "英语",

journal = "IEEE Transactions on Transportation Electrification",

issn = "2332-7782",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Distributed Electric Propulsion Aircraft Attitude Control Based on Assisted Training Multi-Agent Reinforcement Learning

AU - Pang, Shengzhao

AU - Liu, Heng

AU - Chen, Yingxue

AU - Cheng, Bo

AU - Mao, Zhaoyong

AU - Huangfu, Yigeng

PY - 2025

Y1 - 2025

N2 - Distributed Electric Propulsion (DEP) aircraft generate differential thrust by rotational speed differential between distributed electric propulsors. This creates additional control torque, which offers the potential for improved flight performance. However, DEP aircraft is a highly coupled and complex system with multiple inputs and outputs. It is crucial to ensure that the different controlled actuators (distributed electric propulsors, rudder, elevator, and ailerons) work in a coordinated manner. To this end, this paper proposes a Memorable Fuzzy Assisted Training Twin Delayed Deep Deterministic Policy Gradient (MFAT-TD3) algorithm. The MFAT-TD3 algorithm reduces the training time and increases the rate of convergence of the reinforcement learning algorithm through a method of decoupling first and then coupling. This also enhances the generalization performance of individual agents, thus achieving the coordination and cooperation between different controlled actuators of the DEP aircraft, so that the aircraft can fly with high agility. The results show that the proposed algorithm can reduce training time, improve flight efficiency and fault tolerance. Even when there are multiple propulsors failed, it is still able to accomplish the combined power yaw control.

AB - Distributed Electric Propulsion (DEP) aircraft generate differential thrust by rotational speed differential between distributed electric propulsors. This creates additional control torque, which offers the potential for improved flight performance. However, DEP aircraft is a highly coupled and complex system with multiple inputs and outputs. It is crucial to ensure that the different controlled actuators (distributed electric propulsors, rudder, elevator, and ailerons) work in a coordinated manner. To this end, this paper proposes a Memorable Fuzzy Assisted Training Twin Delayed Deep Deterministic Policy Gradient (MFAT-TD3) algorithm. The MFAT-TD3 algorithm reduces the training time and increases the rate of convergence of the reinforcement learning algorithm through a method of decoupling first and then coupling. This also enhances the generalization performance of individual agents, thus achieving the coordination and cooperation between different controlled actuators of the DEP aircraft, so that the aircraft can fly with high agility. The results show that the proposed algorithm can reduce training time, improve flight efficiency and fault tolerance. Even when there are multiple propulsors failed, it is still able to accomplish the combined power yaw control.

KW - attitude control

KW - Distributed electric propulsion

KW - reinforcement learning

KW - twin delayed deep deterministic policy gradient

KW - yaw control

UR - http://www.scopus.com/inward/record.url?scp=105007304216&partnerID=8YFLogxK

U2 - 10.1109/TTE.2025.3576166

DO - 10.1109/TTE.2025.3576166

M3 - 文章

AN - SCOPUS:105007304216

SN - 2332-7782

JO - IEEE Transactions on Transportation Electrification

JF - IEEE Transactions on Transportation Electrification

ER -

Distributed Electric Propulsion Aircraft Attitude Control Based on Assisted Training Multi-Agent Reinforcement Learning

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this