TY - GEN
T1 - Reinforcement Learning-Based Intelligent Decision-Making for Infrared Decoy Deployment
AU - Mi, Yufeng
AU - Liu, Xubin
AU - Bi, Wenhao
AU - Zhang, An
AU - Han, Yanlong
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Infrared decoys are a critical terminal defense measure for airborne platforms against infrared-guided short-range air-to-air missiles, where the quality of the interference strategy largely determines whether an aircraft can successfully evade incoming threats. To overcome the limited adaptability of traditional rule-based strategies in complex threat scenarios, this paper proposes an intelligent decision-making method for infrared decoy countermeasures based on reinforcement learning. First, a simulation environment is constructed that integrates missile dynamics, aircraft maneuvers, infrared decoy motion, and centroid interference models. The interference decision problem is then formulated as a Markov decision process, with carefully designed state and action spaces as well as a tailored reward function. Finally, a Proximal Policy Optimization (PPO) algorithm is employed to train the decision-making agent, enabling it to autonomously learn optimal countermeasure strategies under various threat conditions. Simulation results demonstrate that, compared with conventional rule-based methods, the proposed approach increases the aircraft survival probability from 58% to 92% while satisfying the real-time requirements of terminal defense. This method provides an efficient and intelligent decision-support framework for airborne platforms facing infrared-guided missile threats.
AB - Infrared decoys are a critical terminal defense measure for airborne platforms against infrared-guided short-range air-to-air missiles, where the quality of the interference strategy largely determines whether an aircraft can successfully evade incoming threats. To overcome the limited adaptability of traditional rule-based strategies in complex threat scenarios, this paper proposes an intelligent decision-making method for infrared decoy countermeasures based on reinforcement learning. First, a simulation environment is constructed that integrates missile dynamics, aircraft maneuvers, infrared decoy motion, and centroid interference models. The interference decision problem is then formulated as a Markov decision process, with carefully designed state and action spaces as well as a tailored reward function. Finally, a Proximal Policy Optimization (PPO) algorithm is employed to train the decision-making agent, enabling it to autonomously learn optimal countermeasure strategies under various threat conditions. Simulation results demonstrate that, compared with conventional rule-based methods, the proposed approach increases the aircraft survival probability from 58% to 92% while satisfying the real-time requirements of terminal defense. This method provides an efficient and intelligent decision-support framework for airborne platforms facing infrared-guided missile threats.
KW - infrared decoy
KW - intelligent decision-making
KW - reinforcement learning
KW - terminal defense
UR - https://www.scopus.com/pages/publications/105033561465
U2 - 10.1109/ICACR68388.2025.11360036
DO - 10.1109/ICACR68388.2025.11360036
M3 - 会议稿件
AN - SCOPUS:105033561465
T3 - 2025 9th International Conference on Automation, Control and Robotics, ICACR 2025
SP - 89
EP - 93
BT - 2025 9th International Conference on Automation, Control and Robotics, ICACR 2025
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on Automation, Control and Robotics, ICACR 2025
Y2 - 28 November 2025 through 30 November 2025
ER -