An Enhanced Lagrangian Index Policy for Beam Scheduling of Colocated MIMO Radars

Min Yang; Zengfu Wang; Jing Fu; Xiaoxu Wang; José Niño-Mora

doi:10.1109/TAES.2025.3526910

An Enhanced Lagrangian Index Policy for Beam Scheduling of Colocated MIMO Radars

Min Yang, Zengfu Wang, Jing Fu, Xiaoxu Wang, José Niño-Mora

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

Optimal multi-target tracking in the colocated multiple-input multiple-output radar system with limited beam resources has been widely considered in the past decades but, in general, still remains an open question due to its high complexity. Here, we aim to minimize a measure of the overall error covariance of target kinematic state estimation by appropriately allocating the beam resources to different targets. We model the beam scheduling problem as a restless multi-armed bandit problem that aims to minimize the expected total discounted cost over an infinite time horizon and is in general PSPACE-hard. We improve upon the Whittle relaxation technique by proposing a more stringent method to decompose the correlated restless bandit processes. It leads to a relaxed version of the original optimization problem with a tighter performance bound compared to the Whittle relaxation. Meanwhile, unlike the Lagrangian dynamic program that attaches an independent Lagrangian multiplier to each decision epoch, which is inapplicable for infinite-horizon objectives, our method trades off the number of Lagrangian multipliers against the tightness of the relaxation. The proposed method allows to exploit different relaxation levels and results in a more efficient and effective policy. Numerical experiments demonstrate the effectiveness of the proposed policy.

源语言	英语
期刊	IEEE Transactions on Aerospace and Electronic Systems
DOI	https://doi.org/10.1109/TAES.2025.3526910
出版状态	已接受/待刊 - 2025

访问文件

10.1109/TAES.2025.3526910

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{7d89db9a10ae47ce94bd9f032ba01aa9,

title = "An Enhanced Lagrangian Index Policy for Beam Scheduling of Colocated MIMO Radars",

abstract = "Optimal multi-target tracking in the colocated multiple-input multiple-output radar system with limited beam resources has been widely considered in the past decades but, in general, still remains an open question due to its high complexity. Here, we aim to minimize a measure of the overall error covariance of target kinematic state estimation by appropriately allocating the beam resources to different targets. We model the beam scheduling problem as a restless multi-armed bandit problem that aims to minimize the expected total discounted cost over an infinite time horizon and is in general PSPACE-hard. We improve upon the Whittle relaxation technique by proposing a more stringent method to decompose the correlated restless bandit processes. It leads to a relaxed version of the original optimization problem with a tighter performance bound compared to the Whittle relaxation. Meanwhile, unlike the Lagrangian dynamic program that attaches an independent Lagrangian multiplier to each decision epoch, which is inapplicable for infinite-horizon objectives, our method trades off the number of Lagrangian multipliers against the tightness of the relaxation. The proposed method allows to exploit different relaxation levels and results in a more efficient and effective policy. Numerical experiments demonstrate the effectiveness of the proposed policy.",

keywords = "Lagrangian dynamic programming, Restless multi-armed bandits, sensor scheduling, target tracking, Whittle relaxation",

author = "Min Yang and Zengfu Wang and Jing Fu and Xiaoxu Wang and Jos{\'e} Ni{\~n}o-Mora",

note = "Publisher Copyright: {\textcopyright} 1965-2011 IEEE.",

year = "2025",

doi = "10.1109/TAES.2025.3526910",

language = "英语",

journal = "IEEE Transactions on Aerospace and Electronic Systems",

issn = "0018-9251",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - An Enhanced Lagrangian Index Policy for Beam Scheduling of Colocated MIMO Radars

AU - Yang, Min

AU - Wang, Zengfu

AU - Fu, Jing

AU - Wang, Xiaoxu

AU - Niño-Mora, José

PY - 2025

Y1 - 2025

N2 - Optimal multi-target tracking in the colocated multiple-input multiple-output radar system with limited beam resources has been widely considered in the past decades but, in general, still remains an open question due to its high complexity. Here, we aim to minimize a measure of the overall error covariance of target kinematic state estimation by appropriately allocating the beam resources to different targets. We model the beam scheduling problem as a restless multi-armed bandit problem that aims to minimize the expected total discounted cost over an infinite time horizon and is in general PSPACE-hard. We improve upon the Whittle relaxation technique by proposing a more stringent method to decompose the correlated restless bandit processes. It leads to a relaxed version of the original optimization problem with a tighter performance bound compared to the Whittle relaxation. Meanwhile, unlike the Lagrangian dynamic program that attaches an independent Lagrangian multiplier to each decision epoch, which is inapplicable for infinite-horizon objectives, our method trades off the number of Lagrangian multipliers against the tightness of the relaxation. The proposed method allows to exploit different relaxation levels and results in a more efficient and effective policy. Numerical experiments demonstrate the effectiveness of the proposed policy.

AB - Optimal multi-target tracking in the colocated multiple-input multiple-output radar system with limited beam resources has been widely considered in the past decades but, in general, still remains an open question due to its high complexity. Here, we aim to minimize a measure of the overall error covariance of target kinematic state estimation by appropriately allocating the beam resources to different targets. We model the beam scheduling problem as a restless multi-armed bandit problem that aims to minimize the expected total discounted cost over an infinite time horizon and is in general PSPACE-hard. We improve upon the Whittle relaxation technique by proposing a more stringent method to decompose the correlated restless bandit processes. It leads to a relaxed version of the original optimization problem with a tighter performance bound compared to the Whittle relaxation. Meanwhile, unlike the Lagrangian dynamic program that attaches an independent Lagrangian multiplier to each decision epoch, which is inapplicable for infinite-horizon objectives, our method trades off the number of Lagrangian multipliers against the tightness of the relaxation. The proposed method allows to exploit different relaxation levels and results in a more efficient and effective policy. Numerical experiments demonstrate the effectiveness of the proposed policy.

KW - Lagrangian dynamic programming

KW - Restless multi-armed bandits

KW - sensor scheduling

KW - target tracking

KW - Whittle relaxation

UR - http://www.scopus.com/inward/record.url?scp=85214665830&partnerID=8YFLogxK

U2 - 10.1109/TAES.2025.3526910

DO - 10.1109/TAES.2025.3526910

M3 - 文章

AN - SCOPUS:85214665830

SN - 0018-9251

JO - IEEE Transactions on Aerospace and Electronic Systems

JF - IEEE Transactions on Aerospace and Electronic Systems

ER -

An Enhanced Lagrangian Index Policy for Beam Scheduling of Colocated MIMO Radars

摘要

访问文件

其它文件与链接

指纹

引用此