TY - GEN
T1 - High-Frequency Limitation Deep Reinforcement Learning Based Energy Management Strategy for Dual Fuel Cell Electric Aircraft
AU - Shi, Wenzhuo
AU - Huangfu, Yigeng
AU - Pang, Shengzhao
AU - Xu, Liangcai
AU - Yuan, Cong
AU - Zhuo, Shengrong
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - As an important part of hybrid power source system, energy management strategy (EMS) can be modeled by Markov decision processes and solved by reinforcement learning. A high-frequency limitation deep deterministic policy gradient (DDPG) EMS for electric aircraft is proposed, which consists of two parallel proton exchange membrane fuel cells (PEMFCs) and one battery. DDPG is a classic reinforcement learning algorithm with the advantage of making decisions in a continuous action space. However, its actions have been unsatisfactory due to volatility. By adding high-frequency limitation, high-frequency limitation DDPG can make decisions that limit the output power of the PEMFC when the high-frequency scale is large. To validate the proposed EMS, it is compared with the conventional DDPG EMS, and the simulation results show that the proposed EMS significantly reduces the PEMFC fluctuation compared to the conventional DDPG. Under the same environment, the equivalent hydrogen consumption is 8g lower than that of DDPG, and standard deviations of PEMFC1's stress and PEMFC2's stress are reduced by 68.03% and 55.17%, respectively. In addition, its generalization ability is also verified by adjusting the initial state of charge and extending the operating conditions.
AB - As an important part of hybrid power source system, energy management strategy (EMS) can be modeled by Markov decision processes and solved by reinforcement learning. A high-frequency limitation deep deterministic policy gradient (DDPG) EMS for electric aircraft is proposed, which consists of two parallel proton exchange membrane fuel cells (PEMFCs) and one battery. DDPG is a classic reinforcement learning algorithm with the advantage of making decisions in a continuous action space. However, its actions have been unsatisfactory due to volatility. By adding high-frequency limitation, high-frequency limitation DDPG can make decisions that limit the output power of the PEMFC when the high-frequency scale is large. To validate the proposed EMS, it is compared with the conventional DDPG EMS, and the simulation results show that the proposed EMS significantly reduces the PEMFC fluctuation compared to the conventional DDPG. Under the same environment, the equivalent hydrogen consumption is 8g lower than that of DDPG, and standard deviations of PEMFC1's stress and PEMFC2's stress are reduced by 68.03% and 55.17%, respectively. In addition, its generalization ability is also verified by adjusting the initial state of charge and extending the operating conditions.
KW - Deep Deterministic Policy Gradient
KW - Dual Fuel Cell
KW - Energy management strategy
KW - High-Frequency Limitation
UR - http://www.scopus.com/inward/record.url?scp=85142792297&partnerID=8YFLogxK
U2 - 10.1109/IAS54023.2022.9939937
DO - 10.1109/IAS54023.2022.9939937
M3 - 会议稿件
AN - SCOPUS:85142792297
T3 - Conference Record - IAS Annual Meeting (IEEE Industry Applications Society)
BT - 2022 IEEE Industry Applications Society Annual Meeting, IAS 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE Industry Applications Society Annual Meeting, IAS 2022
Y2 - 9 October 2022 through 14 October 2022
ER -