TY - GEN
T1 - Three-Dimensional Active Defendin Control Method Based on Heuristic Dynamic Programming
AU - Zhang, Bao
AU - Liu, Yang
AU - Yang, Zhen
AU - Li, Yicong
AU - Yang, Chunxiao
AU - Han, Qingceng
AU - Zhou, Deyun
N1 - Publisher Copyright:
© 2025 ICROS.
PY - 2025
Y1 - 2025
N2 - This paper addresses the multi-party optimal control problem of real-time nonlinear systems in the active defense process of the aircraft and defending missiles, proposing a Dual-Action-Dependent Heuristic Dynamic Programming Algorithm based on Three-Dimensional Zero-Control Miss Distance (DADHDP). The algorithm combines approximate dynamic programming with the Actor-Critic framework in reinforcement learning, effectively approximating the infinite-horizon optimal control problem of complex nonlinear systems by iterating dynamic adversarial information in real-time. Compared with traditional methods, DADHDP introduces three-dimensional zero-control miss distance, an Actor network structure with embedded saturation constraints, and a non-quadratic cost functional in the algorithm design, which maintains system stability under control input limitations (such as saturation constraints), enhances solution efficiency, and effectively avoids control failure due to constraint violations in traditional methods.
AB - This paper addresses the multi-party optimal control problem of real-time nonlinear systems in the active defense process of the aircraft and defending missiles, proposing a Dual-Action-Dependent Heuristic Dynamic Programming Algorithm based on Three-Dimensional Zero-Control Miss Distance (DADHDP). The algorithm combines approximate dynamic programming with the Actor-Critic framework in reinforcement learning, effectively approximating the infinite-horizon optimal control problem of complex nonlinear systems by iterating dynamic adversarial information in real-time. Compared with traditional methods, DADHDP introduces three-dimensional zero-control miss distance, an Actor network structure with embedded saturation constraints, and a non-quadratic cost functional in the algorithm design, which maintains system stability under control input limitations (such as saturation constraints), enhances solution efficiency, and effectively avoids control failure due to constraint violations in traditional methods.
KW - Active Defense
KW - Adaptive Dynamic Programming
KW - Guidance Law
KW - Zero-Control Miss Distance
UR - https://www.scopus.com/pages/publications/105031914162
U2 - 10.23919/ICCAS66577.2025.11301371
DO - 10.23919/ICCAS66577.2025.11301371
M3 - 会议稿件
AN - SCOPUS:105031914162
T3 - International Conference on Control, Automation and Systems
SP - 42
EP - 47
BT - 2025 25th International Conference on Control, Automation and Systems, ICCAS 2025
PB - IEEE Computer Society
T2 - 25th International Conference on Control, Automation and Systems, ICCAS 2025
Y2 - 4 November 2025 through 7 November 2025
ER -