TY - JOUR
T1 - Nonzero-Sum Pursuit-Evasion Game Control for Spacecraft Systems
T2 - A Q-Learning Method
AU - Zheng, Zixuan
AU - Zhang, Peng
AU - Yuan, Jianping
N1 - Publisher Copyright:
© 1965-2011 IEEE.
PY - 2023/8/1
Y1 - 2023/8/1
N2 - In this article, the nonzero-sum pursuit-evasion (PE) game control problem is studied for a class of linear spacecraft control systems subject to the complete information case and incomplete information case. The incomplete information includes the cost functions and the control inputs. In practical confrontation situations, due to the incomplete information constraints, it is impossible for the pursuer and the evader to build up the exact opposite cost function. Hence, a nonzero-sum game framework is utilized to describe the PE game problem of the double-spacecraft system. First of all, under the complete information case, the nonzero-sum PE game control strategy is designed by solving the coupled Riccati recursions. Then, aiming at the incomplete information case, a control gain estimator is established, which lays the foundation for the control strategy design of the pursuit spacecraft. On the basis of the estimated control gain, the pursuit control strategy is solved by using the standard discrete-time Riccati recursion. In order to further get rid of the system information, a Q-learning-based control gain is designed for the pursuit spacecraft. Finally, a numerical example on the PE spacecraft system is provided to verify the effectiveness of the proposed PE game control strategies.
AB - In this article, the nonzero-sum pursuit-evasion (PE) game control problem is studied for a class of linear spacecraft control systems subject to the complete information case and incomplete information case. The incomplete information includes the cost functions and the control inputs. In practical confrontation situations, due to the incomplete information constraints, it is impossible for the pursuer and the evader to build up the exact opposite cost function. Hence, a nonzero-sum game framework is utilized to describe the PE game problem of the double-spacecraft system. First of all, under the complete information case, the nonzero-sum PE game control strategy is designed by solving the coupled Riccati recursions. Then, aiming at the incomplete information case, a control gain estimator is established, which lays the foundation for the control strategy design of the pursuit spacecraft. On the basis of the estimated control gain, the pursuit control strategy is solved by using the standard discrete-time Riccati recursion. In order to further get rid of the system information, a Q-learning-based control gain is designed for the pursuit spacecraft. Finally, a numerical example on the PE spacecraft system is provided to verify the effectiveness of the proposed PE game control strategies.
KW - Incomplete information
KW - nonzero-sum game
KW - pursuit-evasion game
KW - q-learning
KW - spacecraft control systems
UR - http://www.scopus.com/inward/record.url?scp=85147219515&partnerID=8YFLogxK
U2 - 10.1109/TAES.2023.3235873
DO - 10.1109/TAES.2023.3235873
M3 - 文章
AN - SCOPUS:85147219515
SN - 0018-9251
VL - 59
SP - 3971
EP - 3981
JO - IEEE Transactions on Aerospace and Electronic Systems
JF - IEEE Transactions on Aerospace and Electronic Systems
IS - 4
ER -