跳到主要导航 跳到搜索 跳到主要内容

Unveiling Backdoor Propagation in Graphs: Neuron-Centric Defense Mechanisms

  • Di Jin
  • , Bingdao Feng
  • , Xiaobao Wang
  • , Yuxiang Zhang
  • , Zechuan Zhang
  • , Liang Yang
  • , Dongxiao He
  • , Zhen Wang
  • Tianjin University
  • Hebei University of Technology

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Defending against backdoor attacks on graphs has become increasingly critical. Existing methods predominantly focus on detecting and removing triggers by identifying inconsistencies between trigger and clean nodes. However, adversaries can design triggers that closely resemble clean nodes, making them challenging to detect. Therefore, understanding the mechanisms underlying backdoor attacks is crucial. In this work, we observe an interesting phenomenon: in backdoored models, specific "backdoor neurons"(embedding dimensions) are more likely to be activated, causing nodes to be misclassified to the target label. This is largely due to the graph structure, where malicious information propagates through node neighborhoods, activating specific neurons and target label. Based on this observation, we theoretically and empirically demonstrate how graph backdoor attacks exploit this propagation mechanism to effectively poison the target node's embedding. Meanwhile, we propose a novel defense called Graph Backdoor Neuron Defense (GBND) to identify, unlearn, and recover backdoor neurons. Specifically, we design a novel reverse engineering technique to identify triggers that activate backdoor neurons, and eliminate their harmful effects by asymmetric unlearning and recovering at the neuron level. Extensive experiments on four datasets validate the effectiveness of GBND in defending against backdoor attacks.

源语言英语
主期刊名WWW 2026 - Proceedings of the ACM Web Conference 2026
出版商Association for Computing Machinery, Inc
1038-1048
页数11
ISBN(电子版)9798400723070
DOI
出版状态已出版 - 12 4月 2026
活动35th ACM Web Conference, WWW 2026 - Dubai, 阿拉伯联合酋长国
期限: 29 6月 20263 7月 2026

出版系列

姓名WWW 2026 - Proceedings of the ACM Web Conference 2026

会议

会议35th ACM Web Conference, WWW 2026
国家/地区阿拉伯联合酋长国
Dubai
时期29/06/263/07/26

指纹

探究 'Unveiling Backdoor Propagation in Graphs: Neuron-Centric Defense Mechanisms' 的科研主题。它们共同构成独一无二的指纹。

引用此