Attentional Feature Erase: Towards task-wise transferable adversarial attack on cloud vision APIs

Bo Cheng; Yantao Lu; Yilan Li; Tao You; Peng Zhang

doi:10.1016/j.displa.2023.102634

Attentional Feature Erase: Towards task-wise transferable adversarial attack on cloud vision APIs

Bo Cheng, Yantao Lu, Yilan Li, Tao You, Peng Zhang

School of Computer Science

Research output: Contribution to journal › Article › peer-review

Abstract

Recent works have shown that adversarial examples (AEs) can attack and successfully transfer across various neural networks, highlighting the potential danger they pose. However, current approaches that focus on task-specific loss functions may not be as effective across different tasks. Additionally, the use of cloud APIs in practice, which often involve combining multiple tasks, also weakens the effectiveness of existing attacks. To address these issues, we propose a method called Attentional Feature Erase, which is a task-agnostic attack with improved cross-task transferability and effectiveness on computer vision-based cloud APIs. We view the transferability of AEs as a latent contribution for each layer of deep neural networks. By focusing on the intermediate layers of model backbones and reducing high-value features in each intermediate feature map, we are able to maximize the attack performance. Additionally, to better aggregate the gradients and generate adversarial perturbations during backward propagation, Transferability Regularizer is proposed to calculate the attention heatmap for each intermediate feature map and systematically combine the gradients. Comprehensive set of experiments on the Google Cloud Vision APIs and public available datasets (i.e. ImageNet, PASCAL VOC and MS COCO) show that the proposed AFE attack is more effective and has better transferability compared to the state-of-the-art baselines.

Original language	English
Article number	102634
Journal	Displays
Volume	82
DOIs	https://doi.org/10.1016/j.displa.2023.102634
State	Published - Apr 2024

Keywords

Adversarial example
Black-box attack
Cloud API attack

Access to Document

10.1016/j.displa.2023.102634

Cite this

@article{21b5333afcf64985bcb7d419bb419f7a,

title = "Attentional Feature Erase: Towards task-wise transferable adversarial attack on cloud vision APIs",

abstract = "Recent works have shown that adversarial examples (AEs) can attack and successfully transfer across various neural networks, highlighting the potential danger they pose. However, current approaches that focus on task-specific loss functions may not be as effective across different tasks. Additionally, the use of cloud APIs in practice, which often involve combining multiple tasks, also weakens the effectiveness of existing attacks. To address these issues, we propose a method called Attentional Feature Erase, which is a task-agnostic attack with improved cross-task transferability and effectiveness on computer vision-based cloud APIs. We view the transferability of AEs as a latent contribution for each layer of deep neural networks. By focusing on the intermediate layers of model backbones and reducing high-value features in each intermediate feature map, we are able to maximize the attack performance. Additionally, to better aggregate the gradients and generate adversarial perturbations during backward propagation, Transferability Regularizer is proposed to calculate the attention heatmap for each intermediate feature map and systematically combine the gradients. Comprehensive set of experiments on the Google Cloud Vision APIs and public available datasets (i.e. ImageNet, PASCAL VOC and MS COCO) show that the proposed AFE attack is more effective and has better transferability compared to the state-of-the-art baselines.",

keywords = "Adversarial example, Black-box attack, Cloud API attack",

author = "Bo Cheng and Yantao Lu and Yilan Li and Tao You and Peng Zhang",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier B.V.",

year = "2024",

month = apr,

doi = "10.1016/j.displa.2023.102634",

language = "英语",

volume = "82",

journal = "Displays",

issn = "0141-9382",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Attentional Feature Erase

T2 - Towards task-wise transferable adversarial attack on cloud vision APIs

AU - Cheng, Bo

AU - Lu, Yantao

AU - Li, Yilan

AU - You, Tao

AU - Zhang, Peng

PY - 2024/4

Y1 - 2024/4

N2 - Recent works have shown that adversarial examples (AEs) can attack and successfully transfer across various neural networks, highlighting the potential danger they pose. However, current approaches that focus on task-specific loss functions may not be as effective across different tasks. Additionally, the use of cloud APIs in practice, which often involve combining multiple tasks, also weakens the effectiveness of existing attacks. To address these issues, we propose a method called Attentional Feature Erase, which is a task-agnostic attack with improved cross-task transferability and effectiveness on computer vision-based cloud APIs. We view the transferability of AEs as a latent contribution for each layer of deep neural networks. By focusing on the intermediate layers of model backbones and reducing high-value features in each intermediate feature map, we are able to maximize the attack performance. Additionally, to better aggregate the gradients and generate adversarial perturbations during backward propagation, Transferability Regularizer is proposed to calculate the attention heatmap for each intermediate feature map and systematically combine the gradients. Comprehensive set of experiments on the Google Cloud Vision APIs and public available datasets (i.e. ImageNet, PASCAL VOC and MS COCO) show that the proposed AFE attack is more effective and has better transferability compared to the state-of-the-art baselines.

AB - Recent works have shown that adversarial examples (AEs) can attack and successfully transfer across various neural networks, highlighting the potential danger they pose. However, current approaches that focus on task-specific loss functions may not be as effective across different tasks. Additionally, the use of cloud APIs in practice, which often involve combining multiple tasks, also weakens the effectiveness of existing attacks. To address these issues, we propose a method called Attentional Feature Erase, which is a task-agnostic attack with improved cross-task transferability and effectiveness on computer vision-based cloud APIs. We view the transferability of AEs as a latent contribution for each layer of deep neural networks. By focusing on the intermediate layers of model backbones and reducing high-value features in each intermediate feature map, we are able to maximize the attack performance. Additionally, to better aggregate the gradients and generate adversarial perturbations during backward propagation, Transferability Regularizer is proposed to calculate the attention heatmap for each intermediate feature map and systematically combine the gradients. Comprehensive set of experiments on the Google Cloud Vision APIs and public available datasets (i.e. ImageNet, PASCAL VOC and MS COCO) show that the proposed AFE attack is more effective and has better transferability compared to the state-of-the-art baselines.

KW - Adversarial example

KW - Black-box attack

KW - Cloud API attack

UR - http://www.scopus.com/inward/record.url?scp=85183386795&partnerID=8YFLogxK

U2 - 10.1016/j.displa.2023.102634

DO - 10.1016/j.displa.2023.102634

M3 - 文章

AN - SCOPUS:85183386795

SN - 0141-9382

VL - 82

JO - Displays

JF - Displays

M1 - 102634

ER -

Attentional Feature Erase: Towards task-wise transferable adversarial attack on cloud vision APIs

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this