TY - GEN
T1 - Airborne Multi-platform Sensor Scheduling Based on Reinforcement Learning
AU - Wang, Yuedong
AU - He, Jing
AU - Yan, Shi
AU - Liang, Yan
N1 - Publisher Copyright:
© 2022, The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
PY - 2022
Y1 - 2022
N2 - In the modern battle, information acquisition is the key for combat success and the reconnaissance is one of the main measures. Aiming at the systematic development of combat units, the reconnaissance mission is usually achieved by multi-platform cooperation. Airborne sensors, as the essential equipment to obtain battlefield information, are coordinated effectively for reaching the operation aim. There are two types of cooperative control strategies, short-sighted and non-short-sighted ones. In the process of strategy optimization, the former only aims to maximize the current immediate return, but ignores the long-term return. In addition, active sensors continuously radiate electromagnetic waves outward when obtaining continuous measurement, which is easy to expose their own position. Therefore, how to improve their ability to survive is particularly important. To this end, considering the target threat, the airborne multi-platform collaborative detection method is proposed based on reinforcement learning, which takes into account the current immediate return as well as the future long-term return, and aims to maximize information perception under the premise of self-security. The simulation tests demonstrate the effectiveness of this method.
AB - In the modern battle, information acquisition is the key for combat success and the reconnaissance is one of the main measures. Aiming at the systematic development of combat units, the reconnaissance mission is usually achieved by multi-platform cooperation. Airborne sensors, as the essential equipment to obtain battlefield information, are coordinated effectively for reaching the operation aim. There are two types of cooperative control strategies, short-sighted and non-short-sighted ones. In the process of strategy optimization, the former only aims to maximize the current immediate return, but ignores the long-term return. In addition, active sensors continuously radiate electromagnetic waves outward when obtaining continuous measurement, which is easy to expose their own position. Therefore, how to improve their ability to survive is particularly important. To this end, considering the target threat, the airborne multi-platform collaborative detection method is proposed based on reinforcement learning, which takes into account the current immediate return as well as the future long-term return, and aims to maximize information perception under the premise of self-security. The simulation tests demonstrate the effectiveness of this method.
KW - Coordinated control
KW - Multi-agent systems
KW - Reinforcement learning control
KW - Sensor scheduling
KW - Task allocation
UR - http://www.scopus.com/inward/record.url?scp=85120620480&partnerID=8YFLogxK
U2 - 10.1007/978-981-15-8155-7_172
DO - 10.1007/978-981-15-8155-7_172
M3 - 会议稿件
AN - SCOPUS:85120620480
SN - 9789811581540
T3 - Lecture Notes in Electrical Engineering
SP - 2049
EP - 2059
BT - Advances in Guidance, Navigation and Control - Proceedings of 2020 International Conference on Guidance, Navigation and Control, ICGNC 2020
A2 - Yan, Liang
A2 - Duan, Haibin
A2 - Yu, Xiang
PB - Springer Science and Business Media Deutschland GmbH
T2 - International Conference on Guidance, Navigation and Control, ICGNC 2020
Y2 - 23 October 2020 through 25 October 2020
ER -