Hierarchical Reinforcement-Learning-Based Joint Allocation of Jamming Task and Power for Countering Networked Radar

Yuedong Wang; Yan Liang; Zengfu Wang

doi:10.1109/TAES.2024.3467041

Hierarchical Reinforcement-Learning-Based Joint Allocation of Jamming Task and Power for Countering Networked Radar

Yuedong Wang, Yan Liang, Zengfu Wang

自动化学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

The detection fusion and antijamming of a networked radar (NR) create a significant dynamic game between the NR and the jammer, making immediate jamming strategies that maximize the current jamming benefit unsuitable. Therefore, this article proposes a long-term joint optimization problem of jamming tasks and power allocation in the NR antijamming fusion game. Specifically, a jammer is utilized to disrupt the joint detection capability of the NR, which possesses detection fusion and jamming suppression mechanisms. By simulating task decomposition and hierarchical control ideas from human decision-making processes, a hierarchical reinforcement-learning-based jamming resource allocation scheme is established. This scheme designs a hierarchical policy network with a shared evaluation network, achieving joint optimization of hybrid discrete (jamming task) and continuous (jamming power) control variables. The value loss, top-level policy loss, and low-level policy loss, which are constructed based on the total reward, are optimized to update the parameters of the evaluation network and the hierarchical policy network, thereby improving allocation strategies. Moreover, the state features and total reward are suitably designed based on the jamming mission to aid the jammer's strategy exploration. Finally, our approach is compared with State-of-the-Art deep reinforcement learning algorithms and timely optimization methods, demonstrating superior jamming performance and shorter decision-making time under typical parameters considering radar deployment and formation motion.

源语言	英语
页（从-至）	2149-2167
页数	19
期刊	IEEE Transactions on Aerospace and Electronic Systems
卷	61
期	2
DOI	https://doi.org/10.1109/TAES.2024.3467041
出版状态	已出版 - 2025

访问文件

10.1109/TAES.2024.3467041

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{2ed2e6b46b5c471bacadd1242a2465e5,

title = "Hierarchical Reinforcement-Learning-Based Joint Allocation of Jamming Task and Power for Countering Networked Radar",

abstract = "The detection fusion and antijamming of a networked radar (NR) create a significant dynamic game between the NR and the jammer, making immediate jamming strategies that maximize the current jamming benefit unsuitable. Therefore, this article proposes a long-term joint optimization problem of jamming tasks and power allocation in the NR antijamming fusion game. Specifically, a jammer is utilized to disrupt the joint detection capability of the NR, which possesses detection fusion and jamming suppression mechanisms. By simulating task decomposition and hierarchical control ideas from human decision-making processes, a hierarchical reinforcement-learning-based jamming resource allocation scheme is established. This scheme designs a hierarchical policy network with a shared evaluation network, achieving joint optimization of hybrid discrete (jamming task) and continuous (jamming power) control variables. The value loss, top-level policy loss, and low-level policy loss, which are constructed based on the total reward, are optimized to update the parameters of the evaluation network and the hierarchical policy network, thereby improving allocation strategies. Moreover, the state features and total reward are suitably designed based on the jamming mission to aid the jammer's strategy exploration. Finally, our approach is compared with State-of-the-Art deep reinforcement learning algorithms and timely optimization methods, demonstrating superior jamming performance and shorter decision-making time under typical parameters considering radar deployment and formation motion.",

author = "Yuedong Wang and Yan Liang and Zengfu Wang",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.",

year = "2025",

doi = "10.1109/TAES.2024.3467041",

language = "英语",

volume = "61",

pages = "2149--2167",

journal = "IEEE Transactions on Aerospace and Electronic Systems",

issn = "0018-9251",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "2",

}

TY - JOUR

T1 - Hierarchical Reinforcement-Learning-Based Joint Allocation of Jamming Task and Power for Countering Networked Radar

AU - Wang, Yuedong

AU - Liang, Yan

AU - Wang, Zengfu

PY - 2025

Y1 - 2025

N2 - The detection fusion and antijamming of a networked radar (NR) create a significant dynamic game between the NR and the jammer, making immediate jamming strategies that maximize the current jamming benefit unsuitable. Therefore, this article proposes a long-term joint optimization problem of jamming tasks and power allocation in the NR antijamming fusion game. Specifically, a jammer is utilized to disrupt the joint detection capability of the NR, which possesses detection fusion and jamming suppression mechanisms. By simulating task decomposition and hierarchical control ideas from human decision-making processes, a hierarchical reinforcement-learning-based jamming resource allocation scheme is established. This scheme designs a hierarchical policy network with a shared evaluation network, achieving joint optimization of hybrid discrete (jamming task) and continuous (jamming power) control variables. The value loss, top-level policy loss, and low-level policy loss, which are constructed based on the total reward, are optimized to update the parameters of the evaluation network and the hierarchical policy network, thereby improving allocation strategies. Moreover, the state features and total reward are suitably designed based on the jamming mission to aid the jammer's strategy exploration. Finally, our approach is compared with State-of-the-Art deep reinforcement learning algorithms and timely optimization methods, demonstrating superior jamming performance and shorter decision-making time under typical parameters considering radar deployment and formation motion.

AB - The detection fusion and antijamming of a networked radar (NR) create a significant dynamic game between the NR and the jammer, making immediate jamming strategies that maximize the current jamming benefit unsuitable. Therefore, this article proposes a long-term joint optimization problem of jamming tasks and power allocation in the NR antijamming fusion game. Specifically, a jammer is utilized to disrupt the joint detection capability of the NR, which possesses detection fusion and jamming suppression mechanisms. By simulating task decomposition and hierarchical control ideas from human decision-making processes, a hierarchical reinforcement-learning-based jamming resource allocation scheme is established. This scheme designs a hierarchical policy network with a shared evaluation network, achieving joint optimization of hybrid discrete (jamming task) and continuous (jamming power) control variables. The value loss, top-level policy loss, and low-level policy loss, which are constructed based on the total reward, are optimized to update the parameters of the evaluation network and the hierarchical policy network, thereby improving allocation strategies. Moreover, the state features and total reward are suitably designed based on the jamming mission to aid the jammer's strategy exploration. Finally, our approach is compared with State-of-the-Art deep reinforcement learning algorithms and timely optimization methods, demonstrating superior jamming performance and shorter decision-making time under typical parameters considering radar deployment and formation motion.

UR - http://www.scopus.com/inward/record.url?scp=85205150588&partnerID=8YFLogxK

U2 - 10.1109/TAES.2024.3467041

DO - 10.1109/TAES.2024.3467041

M3 - 文章

AN - SCOPUS:85205150588

SN - 0018-9251

VL - 61

SP - 2149

EP - 2167

JO - IEEE Transactions on Aerospace and Electronic Systems

JF - IEEE Transactions on Aerospace and Electronic Systems

IS - 2

ER -

Hierarchical Reinforcement-Learning-Based Joint Allocation of Jamming Task and Power for Countering Networked Radar

摘要

访问文件

其它文件与链接

指纹

引用此