A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Recent studies have shown that cooperative multi-agent deep reinforcement learning (c-MADRL) is under the threat of backdoor attacks. Once a backdoor trigger is observed, it will perform abnormal actions leading to failures or malicious goals. However, existing proposed backdoors suffer from several issues, e.g., fixed visual trigger patterns lack stealthiness, the backdoor is trained or activated by an additional network, or all agents are backdoored. To this end, in this paper, we propose a novel backdoor attack against c-MADRL, which attacks the entire multi-agent team by embedding the backdoor only in a single agent. Firstly, we introduce adversary spatiotemporal behavior patterns as the backdoor trigger rather than manual-injected fixed visual patterns or instant status and control the attack duration. This method can guarantee the stealthiness and practicality of injected backdoors. Secondly, we hack the original reward function of the backdoored agent via reward reverse and unilateral guidance during training to ensure its adverse influence on the entire team. We evaluate our backdoor attacks on two classic c-MADRL algorithms VDN and QMIX, in a popular c-MADRL environment SMAC. The experimental results demonstrate that our backdoor attacks are able to reach a high attack success rate (91.6%) while maintaining a low clean performance variance rate (3.7%).

Original languageEnglish
Title of host publicationGLOBECOM 2024 - 2024 IEEE Global Communications Conference
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages4280-4285
Number of pages6
ISBN (Electronic)9798350351255
DOIs
StatePublished - 2024
Event2024 IEEE Global Communications Conference, GLOBECOM 2024 - Cape Town, South Africa
Duration: 8 Dec 202412 Dec 2024

Publication series

NameProceedings - IEEE Global Communications Conference, GLOBECOM
ISSN (Print)2334-0983
ISSN (Electronic)2576-6813

Conference

Conference2024 IEEE Global Communications Conference, GLOBECOM 2024
Country/TerritorySouth Africa
CityCape Town
Period8/12/2412/12/24

Keywords

  • Cooperative multi-agent deep reinforcement learning
  • backdoor attack

Fingerprint

Dive into the research topics of 'A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this