STDatav2: Accessing Efficient Black-Box Stealing for Adversarial Attacks

Xuxiang Sun; Gong Cheng; Hongda Li; Chunbo Lang; Junwei Han

doi:10.1109/TPAMI.2024.3519803

STDatav2: Accessing Efficient Black-Box Stealing for Adversarial Attacks

Xuxiang Sun, Gong Cheng, Hongda Li, Chunbo Lang, Junwei Han

School of Automation

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

On account of the extreme settings, stealing the black-box model without its training data is difficult in practice. On this topic, along the lines of data diversity, this paper substantially makes the following improvements based on our conference version (dubbed STDatav1, short for Surrogate Training Data). First, to mitigate the undesirable impacts of the potential mode collapse while training the generator, we propose the joint-data optimization scheme, which utilizes both the synthesized data and the proxy data to optimize the surrogate model. Second, we propose the self-conditional data synthesis framework, an interesting effort that builds the pseudo-class mapping framework via grouping class information extraction to hold the class-specific constraints while holding the diversity. Within this new framework, we inherit and integrate the class-specific constraints of STDatav1 and design a dual cross-entropy loss to fit this new framework. Finally, to facilitate comprehensive evaluations, we perform experiments on four commonly adopted datasets, and a total of eight kinds of models are employed. These assessments witness the considerable performance gains compared to our early work and demonstrate the competitive ability and promising potential of our approach.

Original language	English
Pages (from-to)	2429-2445
Number of pages	17
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume	47
Issue number	4
DOIs	https://doi.org/10.1109/TPAMI.2024.3519803
State	Published - 2025

Keywords

Black-box attacks
joint-data optimization
model stealing
self-conditional data synthesis
surrogate training data (STData)

Access to Document

10.1109/TPAMI.2024.3519803

Cite this

@article{c45a6c0257da492d967188a34fff3024,

title = "STDatav2: Accessing Efficient Black-Box Stealing for Adversarial Attacks",

abstract = "On account of the extreme settings, stealing the black-box model without its training data is difficult in practice. On this topic, along the lines of data diversity, this paper substantially makes the following improvements based on our conference version (dubbed STDatav1, short for Surrogate Training Data). First, to mitigate the undesirable impacts of the potential mode collapse while training the generator, we propose the joint-data optimization scheme, which utilizes both the synthesized data and the proxy data to optimize the surrogate model. Second, we propose the self-conditional data synthesis framework, an interesting effort that builds the pseudo-class mapping framework via grouping class information extraction to hold the class-specific constraints while holding the diversity. Within this new framework, we inherit and integrate the class-specific constraints of STDatav1 and design a dual cross-entropy loss to fit this new framework. Finally, to facilitate comprehensive evaluations, we perform experiments on four commonly adopted datasets, and a total of eight kinds of models are employed. These assessments witness the considerable performance gains compared to our early work and demonstrate the competitive ability and promising potential of our approach.",

keywords = "Black-box attacks, joint-data optimization, model stealing, self-conditional data synthesis, surrogate training data (STData)",

author = "Xuxiang Sun and Gong Cheng and Hongda Li and Chunbo Lang and Junwei Han",

note = "Publisher Copyright: {\textcopyright} 1979-2012 IEEE.",

year = "2025",

doi = "10.1109/TPAMI.2024.3519803",

language = "英语",

volume = "47",

pages = "2429--2445",

journal = "IEEE Transactions on Pattern Analysis and Machine Intelligence",

issn = "0162-8828",

publisher = "IEEE Computer Society",

number = "4",

}

TY - JOUR

T1 - STDatav2

T2 - Accessing Efficient Black-Box Stealing for Adversarial Attacks

AU - Sun, Xuxiang

AU - Cheng, Gong

AU - Li, Hongda

AU - Lang, Chunbo

AU - Han, Junwei

PY - 2025

Y1 - 2025

N2 - On account of the extreme settings, stealing the black-box model without its training data is difficult in practice. On this topic, along the lines of data diversity, this paper substantially makes the following improvements based on our conference version (dubbed STDatav1, short for Surrogate Training Data). First, to mitigate the undesirable impacts of the potential mode collapse while training the generator, we propose the joint-data optimization scheme, which utilizes both the synthesized data and the proxy data to optimize the surrogate model. Second, we propose the self-conditional data synthesis framework, an interesting effort that builds the pseudo-class mapping framework via grouping class information extraction to hold the class-specific constraints while holding the diversity. Within this new framework, we inherit and integrate the class-specific constraints of STDatav1 and design a dual cross-entropy loss to fit this new framework. Finally, to facilitate comprehensive evaluations, we perform experiments on four commonly adopted datasets, and a total of eight kinds of models are employed. These assessments witness the considerable performance gains compared to our early work and demonstrate the competitive ability and promising potential of our approach.

AB - On account of the extreme settings, stealing the black-box model without its training data is difficult in practice. On this topic, along the lines of data diversity, this paper substantially makes the following improvements based on our conference version (dubbed STDatav1, short for Surrogate Training Data). First, to mitigate the undesirable impacts of the potential mode collapse while training the generator, we propose the joint-data optimization scheme, which utilizes both the synthesized data and the proxy data to optimize the surrogate model. Second, we propose the self-conditional data synthesis framework, an interesting effort that builds the pseudo-class mapping framework via grouping class information extraction to hold the class-specific constraints while holding the diversity. Within this new framework, we inherit and integrate the class-specific constraints of STDatav1 and design a dual cross-entropy loss to fit this new framework. Finally, to facilitate comprehensive evaluations, we perform experiments on four commonly adopted datasets, and a total of eight kinds of models are employed. These assessments witness the considerable performance gains compared to our early work and demonstrate the competitive ability and promising potential of our approach.

KW - Black-box attacks

KW - joint-data optimization

KW - model stealing

KW - self-conditional data synthesis

KW - surrogate training data (STData)

UR - http://www.scopus.com/inward/record.url?scp=86000425604&partnerID=8YFLogxK

U2 - 10.1109/TPAMI.2024.3519803

DO - 10.1109/TPAMI.2024.3519803

M3 - 文章

AN - SCOPUS:86000425604

SN - 0162-8828

VL - 47

SP - 2429

EP - 2445

JO - IEEE Transactions on Pattern Analysis and Machine Intelligence

JF - IEEE Transactions on Pattern Analysis and Machine Intelligence

IS - 4

ER -

STDatav2: Accessing Efficient Black-Box Stealing for Adversarial Attacks

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this