Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States

Jinlong Wei; Nan Jiang; Wu Sun; Chengli Fan; Dengxiu Yu

doi:10.1109/NTCI64025.2024.10776100

Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States

Jinlong Wei, Nan Jiang, Wu Sun, Chengli Fan, Dengxiu Yu

School of Artificial Intelligence, OPtics and Electronics

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Abstract

To address the needs of future intelligent war-fare, this paper proposes a hierarchical intelligent decision-making algorithm (ATT-PPO) based on the PPO algorithm and multi-head attention mechanism, designed to solve air defense sequential decision-making problems under unsta-ble dimensional states. Traditional reinforcement learning struggles to handle variable-dimensional observation information, whereas ATT-PPO uses the attention mechanism to map variable-dimensional observations into fixed dimensions, thereby enhancing the representation of high-dimensional dynamic data. However, high-dimensional decision spaces increase the learning burden of the algorithm. To mitigate this, a hierarchical reinforcement learning architecture is designed, incorporating domain knowledge to narrow the exploration space and improve decision-making efficiency. Simulation results show that ATT-PPO significantly out-performs traditional expert control methods in cumulative rewards, validating its superior performance.

Original language	English
Title of host publication	Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024
Editors	Jian Wang, Witold Pedrycz
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	366-371
Number of pages	6
ISBN (Electronic)	9798331517021
DOIs	https://doi.org/10.1109/NTCI64025.2024.10776100
State	Published - 2024
Event	2024 International Conference on New Trends in Computational Intelligence, NTCI 2024 - Qingdao, China Duration: 18 Oct 2024 → 20 Oct 2024

Publication series

Name	Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024

Conference

Conference	2024 International Conference on New Trends in Computational Intelligence, NTCI 2024
Country/Territory	China
City	Qingdao
Period	18/10/24 → 20/10/24

Keywords

attention mechanism
dynamic observation
Intelligent combat
PPO algorithm

Access to Document

10.1109/NTCI64025.2024.10776100

Cite this

Wei, J., Jiang, N., Sun, W., Fan, C., & Yu, D. (2024). Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States. In J. Wang, & W. Pedrycz (Eds.), Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024 (pp. 366-371). (Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/NTCI64025.2024.10776100

Wei, Jinlong ; Jiang, Nan ; Sun, Wu et al. / Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States. Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024. editor / Jian Wang ; Witold Pedrycz. Institute of Electrical and Electronics Engineers Inc., 2024. pp. 366-371 (Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024).

@inproceedings{2bdb14673db04d7ea740a76824573ed3,

title = "Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States",

abstract = "To address the needs of future intelligent war-fare, this paper proposes a hierarchical intelligent decision-making algorithm (ATT-PPO) based on the PPO algorithm and multi-head attention mechanism, designed to solve air defense sequential decision-making problems under unsta-ble dimensional states. Traditional reinforcement learning struggles to handle variable-dimensional observation information, whereas ATT-PPO uses the attention mechanism to map variable-dimensional observations into fixed dimensions, thereby enhancing the representation of high-dimensional dynamic data. However, high-dimensional decision spaces increase the learning burden of the algorithm. To mitigate this, a hierarchical reinforcement learning architecture is designed, incorporating domain knowledge to narrow the exploration space and improve decision-making efficiency. Simulation results show that ATT-PPO significantly out-performs traditional expert control methods in cumulative rewards, validating its superior performance.",

keywords = "attention mechanism, dynamic observation, Intelligent combat, PPO algorithm",

author = "Jinlong Wei and Nan Jiang and Wu Sun and Chengli Fan and Dengxiu Yu",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024 ; Conference date: 18-10-2024 Through 20-10-2024",

year = "2024",

doi = "10.1109/NTCI64025.2024.10776100",

language = "英语",

series = "Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "366--371",

editor = "Jian Wang and Witold Pedrycz",

booktitle = "Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024",

}

Wei, J, Jiang, N, Sun, W, Fan, C & Yu, D 2024, Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States. in J Wang & W Pedrycz (eds), Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024. Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024, Institute of Electrical and Electronics Engineers Inc., pp. 366-371, 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024, Qingdao, China, 18/10/24. https://doi.org/10.1109/NTCI64025.2024.10776100

Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States. / Wei, Jinlong; Jiang, Nan; Sun, Wu et al.
Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024. ed. / Jian Wang; Witold Pedrycz. Institute of Electrical and Electronics Engineers Inc., 2024. p. 366-371 (Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States

AU - Wei, Jinlong

AU - Jiang, Nan

AU - Sun, Wu

AU - Fan, Chengli

AU - Yu, Dengxiu

PY - 2024

Y1 - 2024

N2 - To address the needs of future intelligent war-fare, this paper proposes a hierarchical intelligent decision-making algorithm (ATT-PPO) based on the PPO algorithm and multi-head attention mechanism, designed to solve air defense sequential decision-making problems under unsta-ble dimensional states. Traditional reinforcement learning struggles to handle variable-dimensional observation information, whereas ATT-PPO uses the attention mechanism to map variable-dimensional observations into fixed dimensions, thereby enhancing the representation of high-dimensional dynamic data. However, high-dimensional decision spaces increase the learning burden of the algorithm. To mitigate this, a hierarchical reinforcement learning architecture is designed, incorporating domain knowledge to narrow the exploration space and improve decision-making efficiency. Simulation results show that ATT-PPO significantly out-performs traditional expert control methods in cumulative rewards, validating its superior performance.

AB - To address the needs of future intelligent war-fare, this paper proposes a hierarchical intelligent decision-making algorithm (ATT-PPO) based on the PPO algorithm and multi-head attention mechanism, designed to solve air defense sequential decision-making problems under unsta-ble dimensional states. Traditional reinforcement learning struggles to handle variable-dimensional observation information, whereas ATT-PPO uses the attention mechanism to map variable-dimensional observations into fixed dimensions, thereby enhancing the representation of high-dimensional dynamic data. However, high-dimensional decision spaces increase the learning burden of the algorithm. To mitigate this, a hierarchical reinforcement learning architecture is designed, incorporating domain knowledge to narrow the exploration space and improve decision-making efficiency. Simulation results show that ATT-PPO significantly out-performs traditional expert control methods in cumulative rewards, validating its superior performance.

KW - attention mechanism

KW - dynamic observation

KW - Intelligent combat

KW - PPO algorithm

UR - http://www.scopus.com/inward/record.url?scp=85215079032&partnerID=8YFLogxK

U2 - 10.1109/NTCI64025.2024.10776100

DO - 10.1109/NTCI64025.2024.10776100

M3 - 会议稿件

AN - SCOPUS:85215079032

T3 - Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024

SP - 366

EP - 371

BT - Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024

A2 - Wang, Jian

A2 - Pedrycz, Witold

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024

Y2 - 18 October 2024 through 20 October 2024

ER -

Wei J, Jiang N, Sun W, Fan C, Yu D. Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States. In Wang J, Pedrycz W, editors, Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024. Institute of Electrical and Electronics Engineers Inc. 2024. p. 366-371. (Proceedings of 2024 International Conference on New Trends in Computational Intelligence, NTCI 2024). doi: 10.1109/NTCI64025.2024.10776100

Efficient Air Defense Temporal Decision-Making Methods Under Unstable Dimensional States

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this