TY - JOUR
T1 - Optimized Fuzzy Attitude Control of Quadrotor Unmanned Aerial Vehicle Using Adaptive Reinforcement Learning Strategy
AU - Wen, Guoxing
AU - Yu, Dengxiu
AU - Zhao, Yanlong
N1 - Publisher Copyright:
© 1965-2011 IEEE.
PY - 2024
Y1 - 2024
N2 - This article is to develop an optimized attitude control using reinforcement learning (RL) for a quadrotor unmanned aerial vehicle (QUAV) system. Since this optimized scheme is required to steer both attitude angle position and velocity states to follow the predefined reference signals, it is a very challenging and interesting work. In theory, an optimal control can be found via solving the Hamilton-Jacobi-Bellman (HJB) equation. Nevertheless, the HJB equation associated with the QUAV attitude dynamic equation is hardly solved via analytical methods owing to the complex nonlinearity. To address the problem, the RL strategy of critic-actor architecture using fuzzy logic system approximation is proposed. For smooth manipulating both attitude angle position and velocity states, the optimized control has to contain both tracking error terms corresponding to the angle position and velocity. Thus, if the optimized attitude control learns from the traditional methods, the control algorithm will be very intricate. Since this RL tuning laws are derived from the negative gradient of a simple positive function that is equivalent to the HJB equation, the control algorithm is significantly simplified. Finally, both theory proof and actual experiment demonstrate that this optimized attitude control is competent for fulfilling the control objective.
AB - This article is to develop an optimized attitude control using reinforcement learning (RL) for a quadrotor unmanned aerial vehicle (QUAV) system. Since this optimized scheme is required to steer both attitude angle position and velocity states to follow the predefined reference signals, it is a very challenging and interesting work. In theory, an optimal control can be found via solving the Hamilton-Jacobi-Bellman (HJB) equation. Nevertheless, the HJB equation associated with the QUAV attitude dynamic equation is hardly solved via analytical methods owing to the complex nonlinearity. To address the problem, the RL strategy of critic-actor architecture using fuzzy logic system approximation is proposed. For smooth manipulating both attitude angle position and velocity states, the optimized control has to contain both tracking error terms corresponding to the angle position and velocity. Thus, if the optimized attitude control learns from the traditional methods, the control algorithm will be very intricate. Since this RL tuning laws are derived from the negative gradient of a simple positive function that is equivalent to the HJB equation, the control algorithm is significantly simplified. Finally, both theory proof and actual experiment demonstrate that this optimized attitude control is competent for fulfilling the control objective.
KW - Fuzzy logic system (FLS)
KW - optimal control
KW - quadrotor attitude
KW - reinforcement learning (RL)
KW - unmanned aerial vehicle (UAV)
UR - http://www.scopus.com/inward/record.url?scp=85193499778&partnerID=8YFLogxK
U2 - 10.1109/TAES.2024.3401668
DO - 10.1109/TAES.2024.3401668
M3 - 文章
AN - SCOPUS:85193499778
SN - 0018-9251
VL - 60
SP - 6075
EP - 6083
JO - IEEE Transactions on Aerospace and Electronic Systems
JF - IEEE Transactions on Aerospace and Electronic Systems
IS - 5
ER -