Nonzero-Sum Pursuit-Evasion Game Control for Spacecraft Systems: A Q-Learning Method

Zixuan Zheng; Peng Zhang; Jianping Yuan

doi:10.1109/TAES.2023.3235873

Nonzero-Sum Pursuit-Evasion Game Control for Spacecraft Systems: A Q-Learning Method

Zixuan Zheng, Peng Zhang, Jianping Yuan

School of Astronautics

Research output: Contribution to journal › Article › peer-review

29 Scopus citations

Abstract

In this article, the nonzero-sum pursuit-evasion (PE) game control problem is studied for a class of linear spacecraft control systems subject to the complete information case and incomplete information case. The incomplete information includes the cost functions and the control inputs. In practical confrontation situations, due to the incomplete information constraints, it is impossible for the pursuer and the evader to build up the exact opposite cost function. Hence, a nonzero-sum game framework is utilized to describe the PE game problem of the double-spacecraft system. First of all, under the complete information case, the nonzero-sum PE game control strategy is designed by solving the coupled Riccati recursions. Then, aiming at the incomplete information case, a control gain estimator is established, which lays the foundation for the control strategy design of the pursuit spacecraft. On the basis of the estimated control gain, the pursuit control strategy is solved by using the standard discrete-time Riccati recursion. In order to further get rid of the system information, a Q-learning-based control gain is designed for the pursuit spacecraft. Finally, a numerical example on the PE spacecraft system is provided to verify the effectiveness of the proposed PE game control strategies.

Original language	English
Pages (from-to)	3971-3981
Number of pages	11
Journal	IEEE Transactions on Aerospace and Electronic Systems
Volume	59
Issue number	4
DOIs	https://doi.org/10.1109/TAES.2023.3235873
State	Published - 1 Aug 2023

Keywords

Incomplete information
nonzero-sum game
pursuit-evasion game
q-learning
spacecraft control systems

Access to Document

10.1109/TAES.2023.3235873

Cite this

@article{99f9f8c62cba451cbd93fe743a70d07f,

title = "Nonzero-Sum Pursuit-Evasion Game Control for Spacecraft Systems: A Q-Learning Method",

abstract = "In this article, the nonzero-sum pursuit-evasion (PE) game control problem is studied for a class of linear spacecraft control systems subject to the complete information case and incomplete information case. The incomplete information includes the cost functions and the control inputs. In practical confrontation situations, due to the incomplete information constraints, it is impossible for the pursuer and the evader to build up the exact opposite cost function. Hence, a nonzero-sum game framework is utilized to describe the PE game problem of the double-spacecraft system. First of all, under the complete information case, the nonzero-sum PE game control strategy is designed by solving the coupled Riccati recursions. Then, aiming at the incomplete information case, a control gain estimator is established, which lays the foundation for the control strategy design of the pursuit spacecraft. On the basis of the estimated control gain, the pursuit control strategy is solved by using the standard discrete-time Riccati recursion. In order to further get rid of the system information, a Q-learning-based control gain is designed for the pursuit spacecraft. Finally, a numerical example on the PE spacecraft system is provided to verify the effectiveness of the proposed PE game control strategies.",

keywords = "Incomplete information, nonzero-sum game, pursuit-evasion game, q-learning, spacecraft control systems",

author = "Zixuan Zheng and Peng Zhang and Jianping Yuan",

note = "Publisher Copyright: {\textcopyright} 1965-2011 IEEE.",

year = "2023",

month = aug,

day = "1",

doi = "10.1109/TAES.2023.3235873",

language = "英语",

volume = "59",

pages = "3971--3981",

journal = "IEEE Transactions on Aerospace and Electronic Systems",

issn = "0018-9251",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "4",

}

TY - JOUR

T1 - Nonzero-Sum Pursuit-Evasion Game Control for Spacecraft Systems

T2 - A Q-Learning Method

AU - Zheng, Zixuan

AU - Zhang, Peng

AU - Yuan, Jianping

PY - 2023/8/1

Y1 - 2023/8/1

N2 - In this article, the nonzero-sum pursuit-evasion (PE) game control problem is studied for a class of linear spacecraft control systems subject to the complete information case and incomplete information case. The incomplete information includes the cost functions and the control inputs. In practical confrontation situations, due to the incomplete information constraints, it is impossible for the pursuer and the evader to build up the exact opposite cost function. Hence, a nonzero-sum game framework is utilized to describe the PE game problem of the double-spacecraft system. First of all, under the complete information case, the nonzero-sum PE game control strategy is designed by solving the coupled Riccati recursions. Then, aiming at the incomplete information case, a control gain estimator is established, which lays the foundation for the control strategy design of the pursuit spacecraft. On the basis of the estimated control gain, the pursuit control strategy is solved by using the standard discrete-time Riccati recursion. In order to further get rid of the system information, a Q-learning-based control gain is designed for the pursuit spacecraft. Finally, a numerical example on the PE spacecraft system is provided to verify the effectiveness of the proposed PE game control strategies.

AB - In this article, the nonzero-sum pursuit-evasion (PE) game control problem is studied for a class of linear spacecraft control systems subject to the complete information case and incomplete information case. The incomplete information includes the cost functions and the control inputs. In practical confrontation situations, due to the incomplete information constraints, it is impossible for the pursuer and the evader to build up the exact opposite cost function. Hence, a nonzero-sum game framework is utilized to describe the PE game problem of the double-spacecraft system. First of all, under the complete information case, the nonzero-sum PE game control strategy is designed by solving the coupled Riccati recursions. Then, aiming at the incomplete information case, a control gain estimator is established, which lays the foundation for the control strategy design of the pursuit spacecraft. On the basis of the estimated control gain, the pursuit control strategy is solved by using the standard discrete-time Riccati recursion. In order to further get rid of the system information, a Q-learning-based control gain is designed for the pursuit spacecraft. Finally, a numerical example on the PE spacecraft system is provided to verify the effectiveness of the proposed PE game control strategies.

KW - Incomplete information

KW - nonzero-sum game

KW - pursuit-evasion game

KW - q-learning

KW - spacecraft control systems

UR - http://www.scopus.com/inward/record.url?scp=85147219515&partnerID=8YFLogxK

U2 - 10.1109/TAES.2023.3235873

DO - 10.1109/TAES.2023.3235873

M3 - 文章

AN - SCOPUS:85147219515

SN - 0018-9251

VL - 59

SP - 3971

EP - 3981

JO - IEEE Transactions on Aerospace and Electronic Systems

JF - IEEE Transactions on Aerospace and Electronic Systems

IS - 4

ER -

Nonzero-Sum Pursuit-Evasion Game Control for Spacecraft Systems: A Q-Learning Method

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this