CGN: Class gradient network for the construction of adversarial samples

Xiang Li; Haiwang Guo; Xinyang Deng; Wen Jiang

doi:10.1016/j.ins.2023.119855

CGN: Class gradient network for the construction of adversarial samples

Xiang Li, Haiwang Guo, Xinyang Deng, Wen Jiang

School of Electronics and Information

Research output: Contribution to journal › Article › peer-review

Abstract

Deep neural networks (DNNs) have tremendously succeeded in several computer vision-related fields. Nevertheless, previous research demonstrates that DNNs are vulnerable to adversarial sample attacks. Attackers add carefully designed perturbation noise to clean samples to form adversarial samples, which may lead to errors in the DNNs' predictions. Consequently, the safety of deep learning has attracted much attention, and researchers have commenced exploring adversarial samples from different perspectives. In this paper, a method based on class gradient networks (CGN) is proposed, which can generate high-quality adversarial samples by designing multiple objective functions. Specifically, the adversarial sample's high-level features are guided to change by introducing a high-level class gradient matrix, and the classification loss and perturbation loss are combined to jointly train a generator to fit the distribution of adversarial noises. We conducted experiments on two standard datasets, Fashion-MNIST and CIFAR-10. The results demonstrate the superiority of our method in the transferability of adversarial samples on targeted attacks and indicate the approach outperforms the baseline method.

Original language	English
Article number	119855
Journal	Information Sciences
Volume	654
DOIs	https://doi.org/10.1016/j.ins.2023.119855
State	Published - Jan 2024

Keywords

Adversarial samples
Class gradient matrix
Generator
Transferability

Access to Document

10.1016/j.ins.2023.119855

Cite this

@article{32103970be644484ba2cf566afec1cdb,

title = "CGN: Class gradient network for the construction of adversarial samples",

abstract = "Deep neural networks (DNNs) have tremendously succeeded in several computer vision-related fields. Nevertheless, previous research demonstrates that DNNs are vulnerable to adversarial sample attacks. Attackers add carefully designed perturbation noise to clean samples to form adversarial samples, which may lead to errors in the DNNs' predictions. Consequently, the safety of deep learning has attracted much attention, and researchers have commenced exploring adversarial samples from different perspectives. In this paper, a method based on class gradient networks (CGN) is proposed, which can generate high-quality adversarial samples by designing multiple objective functions. Specifically, the adversarial sample's high-level features are guided to change by introducing a high-level class gradient matrix, and the classification loss and perturbation loss are combined to jointly train a generator to fit the distribution of adversarial noises. We conducted experiments on two standard datasets, Fashion-MNIST and CIFAR-10. The results demonstrate the superiority of our method in the transferability of adversarial samples on targeted attacks and indicate the approach outperforms the baseline method.",

keywords = "Adversarial samples, Class gradient matrix, Generator, Transferability",

author = "Xiang Li and Haiwang Guo and Xinyang Deng and Wen Jiang",

note = "Publisher Copyright: {\textcopyright} 2023 Elsevier Inc.",

year = "2024",

month = jan,

doi = "10.1016/j.ins.2023.119855",

language = "英语",

volume = "654",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier Inc.",

}

TY - JOUR

T1 - CGN

T2 - Class gradient network for the construction of adversarial samples

AU - Li, Xiang

AU - Guo, Haiwang

AU - Deng, Xinyang

AU - Jiang, Wen

PY - 2024/1

Y1 - 2024/1

N2 - Deep neural networks (DNNs) have tremendously succeeded in several computer vision-related fields. Nevertheless, previous research demonstrates that DNNs are vulnerable to adversarial sample attacks. Attackers add carefully designed perturbation noise to clean samples to form adversarial samples, which may lead to errors in the DNNs' predictions. Consequently, the safety of deep learning has attracted much attention, and researchers have commenced exploring adversarial samples from different perspectives. In this paper, a method based on class gradient networks (CGN) is proposed, which can generate high-quality adversarial samples by designing multiple objective functions. Specifically, the adversarial sample's high-level features are guided to change by introducing a high-level class gradient matrix, and the classification loss and perturbation loss are combined to jointly train a generator to fit the distribution of adversarial noises. We conducted experiments on two standard datasets, Fashion-MNIST and CIFAR-10. The results demonstrate the superiority of our method in the transferability of adversarial samples on targeted attacks and indicate the approach outperforms the baseline method.

AB - Deep neural networks (DNNs) have tremendously succeeded in several computer vision-related fields. Nevertheless, previous research demonstrates that DNNs are vulnerable to adversarial sample attacks. Attackers add carefully designed perturbation noise to clean samples to form adversarial samples, which may lead to errors in the DNNs' predictions. Consequently, the safety of deep learning has attracted much attention, and researchers have commenced exploring adversarial samples from different perspectives. In this paper, a method based on class gradient networks (CGN) is proposed, which can generate high-quality adversarial samples by designing multiple objective functions. Specifically, the adversarial sample's high-level features are guided to change by introducing a high-level class gradient matrix, and the classification loss and perturbation loss are combined to jointly train a generator to fit the distribution of adversarial noises. We conducted experiments on two standard datasets, Fashion-MNIST and CIFAR-10. The results demonstrate the superiority of our method in the transferability of adversarial samples on targeted attacks and indicate the approach outperforms the baseline method.

KW - Adversarial samples

KW - Class gradient matrix

KW - Generator

KW - Transferability

UR - http://www.scopus.com/inward/record.url?scp=85176277762&partnerID=8YFLogxK

U2 - 10.1016/j.ins.2023.119855

DO - 10.1016/j.ins.2023.119855

M3 - 文章

AN - SCOPUS:85176277762

SN - 0020-0255

VL - 654

JO - Information Sciences

JF - Information Sciences

M1 - 119855

ER -

CGN: Class gradient network for the construction of adversarial samples

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this