Generative attack in complex real-world scenarios

Hongyu Peng; Gong Cheng; Xuxiang Sun

doi:10.1016/j.patcog.2025.111893

Generative attack in complex real-world scenarios

Hongyu Peng, Gong Cheng, Xuxiang Sun

School of Automation

Northwestern Polytechnical University Xian

Research output: Contribution to journal › Article › peer-review

Abstract

Existing generative attacks primarily learn from single-object scenarios, and they might fail to handle the intricate spatial and semantic relationships between multiple objects, which are common in real-world scenarios with dense occlusions and other complexities. Addressing these limitations, we propose Generative Attack in Complex Real-world Scenarios (GACRS), a novel method designed to enhance the transferability of adversarial examples. Primarily, our analysis indicates that existing method for utilizing the CLIP text branch is limited, mainly due to its random sampling strategies, which introduces sampling bias and restricts it to the scenarios with only two categories within a single scene. Thus, we propose a multi-object clustering-based text sampling method tailored for the CLIP text branch, thereby enhancing the diversity and relevance of text features and providing more meaningful guidance for generator optimization. In addition, to the best of our knowledge, we are the first to apply curriculum learning to the training process of generative attacks. This operation involves a dynamic input sample selection strategy that adapts to different training phases, enabling the generator to transition from simpler tasks to more complex tasks, thereby improving the generalization capability of adversarial perturbations. Extensive experiments across within-domain, cross-domain, and cross-task scenarios show that GACRS consistently outperforms existing peer methods. Codes will be released at https://github.com/phyyyy/GACRS.

Original language	English
Article number	111893
Journal	Pattern Recognition
Volume	169
DOIs	https://doi.org/10.1016/j.patcog.2025.111893
State	Published - Jan 2026

Keywords

Clustering
Computer vision
Curriculum learning
Generative attack
Pattern recognition

Access to Document

10.1016/j.patcog.2025.111893

Cite this

@article{afcd7eac97314f7ca43cc9b2dd249f68,

title = "Generative attack in complex real-world scenarios",

abstract = "Existing generative attacks primarily learn from single-object scenarios, and they might fail to handle the intricate spatial and semantic relationships between multiple objects, which are common in real-world scenarios with dense occlusions and other complexities. Addressing these limitations, we propose Generative Attack in Complex Real-world Scenarios (GACRS), a novel method designed to enhance the transferability of adversarial examples. Primarily, our analysis indicates that existing method for utilizing the CLIP text branch is limited, mainly due to its random sampling strategies, which introduces sampling bias and restricts it to the scenarios with only two categories within a single scene. Thus, we propose a multi-object clustering-based text sampling method tailored for the CLIP text branch, thereby enhancing the diversity and relevance of text features and providing more meaningful guidance for generator optimization. In addition, to the best of our knowledge, we are the first to apply curriculum learning to the training process of generative attacks. This operation involves a dynamic input sample selection strategy that adapts to different training phases, enabling the generator to transition from simpler tasks to more complex tasks, thereby improving the generalization capability of adversarial perturbations. Extensive experiments across within-domain, cross-domain, and cross-task scenarios show that GACRS consistently outperforms existing peer methods. Codes will be released at https://github.com/phyyyy/GACRS.",

keywords = "Clustering, Computer vision, Curriculum learning, Generative attack, Pattern recognition",

author = "Hongyu Peng and Gong Cheng and Xuxiang Sun",

note = "Publisher Copyright: {\textcopyright} 2025 Elsevier Ltd",

year = "2026",

month = jan,

doi = "10.1016/j.patcog.2025.111893",

language = "英语",

volume = "169",

journal = "Pattern Recognition",

issn = "0031-3203",

publisher = "Elsevier Ltd",

}

TY - JOUR

T1 - Generative attack in complex real-world scenarios

AU - Peng, Hongyu

AU - Cheng, Gong

AU - Sun, Xuxiang

PY - 2026/1

Y1 - 2026/1

N2 - Existing generative attacks primarily learn from single-object scenarios, and they might fail to handle the intricate spatial and semantic relationships between multiple objects, which are common in real-world scenarios with dense occlusions and other complexities. Addressing these limitations, we propose Generative Attack in Complex Real-world Scenarios (GACRS), a novel method designed to enhance the transferability of adversarial examples. Primarily, our analysis indicates that existing method for utilizing the CLIP text branch is limited, mainly due to its random sampling strategies, which introduces sampling bias and restricts it to the scenarios with only two categories within a single scene. Thus, we propose a multi-object clustering-based text sampling method tailored for the CLIP text branch, thereby enhancing the diversity and relevance of text features and providing more meaningful guidance for generator optimization. In addition, to the best of our knowledge, we are the first to apply curriculum learning to the training process of generative attacks. This operation involves a dynamic input sample selection strategy that adapts to different training phases, enabling the generator to transition from simpler tasks to more complex tasks, thereby improving the generalization capability of adversarial perturbations. Extensive experiments across within-domain, cross-domain, and cross-task scenarios show that GACRS consistently outperforms existing peer methods. Codes will be released at https://github.com/phyyyy/GACRS.

AB - Existing generative attacks primarily learn from single-object scenarios, and they might fail to handle the intricate spatial and semantic relationships between multiple objects, which are common in real-world scenarios with dense occlusions and other complexities. Addressing these limitations, we propose Generative Attack in Complex Real-world Scenarios (GACRS), a novel method designed to enhance the transferability of adversarial examples. Primarily, our analysis indicates that existing method for utilizing the CLIP text branch is limited, mainly due to its random sampling strategies, which introduces sampling bias and restricts it to the scenarios with only two categories within a single scene. Thus, we propose a multi-object clustering-based text sampling method tailored for the CLIP text branch, thereby enhancing the diversity and relevance of text features and providing more meaningful guidance for generator optimization. In addition, to the best of our knowledge, we are the first to apply curriculum learning to the training process of generative attacks. This operation involves a dynamic input sample selection strategy that adapts to different training phases, enabling the generator to transition from simpler tasks to more complex tasks, thereby improving the generalization capability of adversarial perturbations. Extensive experiments across within-domain, cross-domain, and cross-task scenarios show that GACRS consistently outperforms existing peer methods. Codes will be released at https://github.com/phyyyy/GACRS.

KW - Clustering

KW - Computer vision

KW - Curriculum learning

KW - Generative attack

KW - Pattern recognition

UR - http://www.scopus.com/inward/record.url?scp=105007853602&partnerID=8YFLogxK

U2 - 10.1016/j.patcog.2025.111893

DO - 10.1016/j.patcog.2025.111893

M3 - 文章

AN - SCOPUS:105007853602

SN - 0031-3203

VL - 169

JO - Pattern Recognition

JF - Pattern Recognition

M1 - 111893

ER -

Generative attack in complex real-world scenarios

Abstract

Keywords

Access to Document

Other files and links

Cite this