A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning

Yinbo Yu; Saihao Yan; Jiajia Liu

doi:10.1109/GLOBECOM52923.2024.10901370

A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning

Yinbo Yu, Saihao Yan, Jiajia Liu

网络空间安全学院

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

Recent studies have shown that cooperative multi-agent deep reinforcement learning (c-MADRL) is under the threat of backdoor attacks. Once a backdoor trigger is observed, it will perform abnormal actions leading to failures or malicious goals. However, existing proposed backdoors suffer from several issues, e.g., fixed visual trigger patterns lack stealthiness, the backdoor is trained or activated by an additional network, or all agents are backdoored. To this end, in this paper, we propose a novel backdoor attack against c-MADRL, which attacks the entire multi-agent team by embedding the backdoor only in a single agent. Firstly, we introduce adversary spatiotemporal behavior patterns as the backdoor trigger rather than manual-injected fixed visual patterns or instant status and control the attack duration. This method can guarantee the stealthiness and practicality of injected backdoors. Secondly, we hack the original reward function of the backdoored agent via reward reverse and unilateral guidance during training to ensure its adverse influence on the entire team. We evaluate our backdoor attacks on two classic c-MADRL algorithms VDN and QMIX, in a popular c-MADRL environment SMAC. The experimental results demonstrate that our backdoor attacks are able to reach a high attack success rate (91.6%) while maintaining a low clean performance variance rate (3.7%).

源语言	英语
主期刊名	GLOBECOM 2024 - 2024 IEEE Global Communications Conference
出版商	Institute of Electrical and Electronics Engineers Inc.
页	4280-4285
页数	6
ISBN（电子版）	9798350351255
DOI	https://doi.org/10.1109/GLOBECOM52923.2024.10901370
出版状态	已出版 - 2024
活动	2024 IEEE Global Communications Conference, GLOBECOM 2024 - Cape Town, 南非期限: 8 12月 2024 → 12 12月 2024

出版系列

姓名	Proceedings - IEEE Global Communications Conference, GLOBECOM
ISSN（印刷版）	2334-0983
ISSN（电子版）	2576-6813

会议

会议	2024 IEEE Global Communications Conference, GLOBECOM 2024
国家/地区	南非
市	Cape Town
时期	8/12/24 → 12/12/24

访问文件

10.1109/GLOBECOM52923.2024.10901370

其它文件与链接

链接到 Scopus 的出版物

引用此

Yu, Y., Yan, S., & Liu, J. (2024). A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning. 在 GLOBECOM 2024 - 2024 IEEE Global Communications Conference (页码 4280-4285). (Proceedings - IEEE Global Communications Conference, GLOBECOM). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/GLOBECOM52923.2024.10901370

@inproceedings{5fe26d7d0af4400db8c250f6b22e7a71,

title = "A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning",

abstract = "Recent studies have shown that cooperative multi-agent deep reinforcement learning (c-MADRL) is under the threat of backdoor attacks. Once a backdoor trigger is observed, it will perform abnormal actions leading to failures or malicious goals. However, existing proposed backdoors suffer from several issues, e.g., fixed visual trigger patterns lack stealthiness, the backdoor is trained or activated by an additional network, or all agents are backdoored. To this end, in this paper, we propose a novel backdoor attack against c-MADRL, which attacks the entire multi-agent team by embedding the backdoor only in a single agent. Firstly, we introduce adversary spatiotemporal behavior patterns as the backdoor trigger rather than manual-injected fixed visual patterns or instant status and control the attack duration. This method can guarantee the stealthiness and practicality of injected backdoors. Secondly, we hack the original reward function of the backdoored agent via reward reverse and unilateral guidance during training to ensure its adverse influence on the entire team. We evaluate our backdoor attacks on two classic c-MADRL algorithms VDN and QMIX, in a popular c-MADRL environment SMAC. The experimental results demonstrate that our backdoor attacks are able to reach a high attack success rate (91.6%) while maintaining a low clean performance variance rate (3.7%).",

keywords = "Cooperative multi-agent deep reinforcement learning, backdoor attack",

author = "Yinbo Yu and Saihao Yan and Jiajia Liu",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE Global Communications Conference, GLOBECOM 2024 ; Conference date: 08-12-2024 Through 12-12-2024",

year = "2024",

doi = "10.1109/GLOBECOM52923.2024.10901370",

language = "英语",

series = "Proceedings - IEEE Global Communications Conference, GLOBECOM",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "4280--4285",

booktitle = "GLOBECOM 2024 - 2024 IEEE Global Communications Conference",

}

Yu, Y, Yan, S & Liu, J 2024, A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning. 在 GLOBECOM 2024 - 2024 IEEE Global Communications Conference. Proceedings - IEEE Global Communications Conference, GLOBECOM, Institute of Electrical and Electronics Engineers Inc., 页码 4280-4285, 2024 IEEE Global Communications Conference, GLOBECOM 2024, Cape Town, 南非, 8/12/24. https://doi.org/10.1109/GLOBECOM52923.2024.10901370

A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning. / Yu, Yinbo; Yan, Saihao; Liu, Jiajia.
GLOBECOM 2024 - 2024 IEEE Global Communications Conference. Institute of Electrical and Electronics Engineers Inc., 2024. 页码 4280-4285 (Proceedings - IEEE Global Communications Conference, GLOBECOM).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning

AU - Yu, Yinbo

AU - Yan, Saihao

AU - Liu, Jiajia

PY - 2024

Y1 - 2024

N2 - Recent studies have shown that cooperative multi-agent deep reinforcement learning (c-MADRL) is under the threat of backdoor attacks. Once a backdoor trigger is observed, it will perform abnormal actions leading to failures or malicious goals. However, existing proposed backdoors suffer from several issues, e.g., fixed visual trigger patterns lack stealthiness, the backdoor is trained or activated by an additional network, or all agents are backdoored. To this end, in this paper, we propose a novel backdoor attack against c-MADRL, which attacks the entire multi-agent team by embedding the backdoor only in a single agent. Firstly, we introduce adversary spatiotemporal behavior patterns as the backdoor trigger rather than manual-injected fixed visual patterns or instant status and control the attack duration. This method can guarantee the stealthiness and practicality of injected backdoors. Secondly, we hack the original reward function of the backdoored agent via reward reverse and unilateral guidance during training to ensure its adverse influence on the entire team. We evaluate our backdoor attacks on two classic c-MADRL algorithms VDN and QMIX, in a popular c-MADRL environment SMAC. The experimental results demonstrate that our backdoor attacks are able to reach a high attack success rate (91.6%) while maintaining a low clean performance variance rate (3.7%).

AB - Recent studies have shown that cooperative multi-agent deep reinforcement learning (c-MADRL) is under the threat of backdoor attacks. Once a backdoor trigger is observed, it will perform abnormal actions leading to failures or malicious goals. However, existing proposed backdoors suffer from several issues, e.g., fixed visual trigger patterns lack stealthiness, the backdoor is trained or activated by an additional network, or all agents are backdoored. To this end, in this paper, we propose a novel backdoor attack against c-MADRL, which attacks the entire multi-agent team by embedding the backdoor only in a single agent. Firstly, we introduce adversary spatiotemporal behavior patterns as the backdoor trigger rather than manual-injected fixed visual patterns or instant status and control the attack duration. This method can guarantee the stealthiness and practicality of injected backdoors. Secondly, we hack the original reward function of the backdoored agent via reward reverse and unilateral guidance during training to ensure its adverse influence on the entire team. We evaluate our backdoor attacks on two classic c-MADRL algorithms VDN and QMIX, in a popular c-MADRL environment SMAC. The experimental results demonstrate that our backdoor attacks are able to reach a high attack success rate (91.6%) while maintaining a low clean performance variance rate (3.7%).

KW - Cooperative multi-agent deep reinforcement learning

KW - backdoor attack

UR - http://www.scopus.com/inward/record.url?scp=105000822123&partnerID=8YFLogxK

U2 - 10.1109/GLOBECOM52923.2024.10901370

DO - 10.1109/GLOBECOM52923.2024.10901370

M3 - 会议稿件

AN - SCOPUS:105000822123

T3 - Proceedings - IEEE Global Communications Conference, GLOBECOM

SP - 4280

EP - 4285

BT - GLOBECOM 2024 - 2024 IEEE Global Communications Conference

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2024 IEEE Global Communications Conference, GLOBECOM 2024

Y2 - 8 December 2024 through 12 December 2024

ER -

Yu Y, Yan S, Liu J. A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning. 在 GLOBECOM 2024 - 2024 IEEE Global Communications Conference. Institute of Electrical and Electronics Engineers Inc. 2024. 页码 4280-4285. (Proceedings - IEEE Global Communications Conference, GLOBECOM). doi: 10.1109/GLOBECOM52923.2024.10901370

A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此