Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval

Tianshi Wang; Lei Zhu; Zheng Zhang; Huaxiang Zhang; Junwei Han

doi:10.1109/TCSVT.2023.3263054

Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval

Tianshi Wang, Lei Zhu, Zheng Zhang, Huaxiang Zhang, Junwei Han

自动化学院

科研成果: 期刊稿件 › 文章 › 同行评审

34 引用（Scopus）

摘要

Deep cross-modal hashing has achieved excellent retrieval performance with the powerful representation capability of deep neural networks. Regrettably, current methods are inevitably vulnerable to adversarial attacks, especially well-designed subtle perturbations that can easily fool deep cross-modal hashing models into returning irrelevant or the attacker's specified results. Although adversarial attacks have attracted increasing attention, there are few studies on specialized attacks against deep cross-modal hashing. To solve these issues, we propose a targeted adversarial attack method against deep cross-modal hashing retrieval in this paper. To the best of our knowledge, this is the first work in this research field. Concretely, we first build a progressive fusion module to extract fine-grained target semantics through a progressive attention mechanism. Meanwhile, we design a semantic adaptation network to generate the target prototype code and reconstruct the category label, thus realizing the semantic interaction between the target semantics and the implicit semantics of the attacked model. To bridge modality gaps and preserve local example details, a semantic translator seamlessly translates the target semantics and then embeds them into benign examples in collaboration with a U-Net framework. Moreover, we construct a discriminator for adversarial training, which enhances the visual realism and category discrimination of adversarial examples, thus improving their targeted attack performance. Extensive experiments on widely tested cross-modal retrieval datasets demonstrate the superiority of our proposed method. Also, transferable attacks show that our generated adversarial examples have well generalization capability on targeted attacks. The source codes and datasets are available at https://github.com/tswang0116/TA-DCH.

源语言	英语
页（从-至）	6159-6172
页数	14
期刊	IEEE Transactions on Circuits and Systems for Video Technology
卷	33
期	10
DOI	https://doi.org/10.1109/TCSVT.2023.3263054
出版状态	已出版 - 1 10月 2023

访问文件

10.1109/TCSVT.2023.3263054

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{1b9a5bbb95bf4b64bd20eec102e00e6a,

title = "Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval",

abstract = "Deep cross-modal hashing has achieved excellent retrieval performance with the powerful representation capability of deep neural networks. Regrettably, current methods are inevitably vulnerable to adversarial attacks, especially well-designed subtle perturbations that can easily fool deep cross-modal hashing models into returning irrelevant or the attacker's specified results. Although adversarial attacks have attracted increasing attention, there are few studies on specialized attacks against deep cross-modal hashing. To solve these issues, we propose a targeted adversarial attack method against deep cross-modal hashing retrieval in this paper. To the best of our knowledge, this is the first work in this research field. Concretely, we first build a progressive fusion module to extract fine-grained target semantics through a progressive attention mechanism. Meanwhile, we design a semantic adaptation network to generate the target prototype code and reconstruct the category label, thus realizing the semantic interaction between the target semantics and the implicit semantics of the attacked model. To bridge modality gaps and preserve local example details, a semantic translator seamlessly translates the target semantics and then embeds them into benign examples in collaboration with a U-Net framework. Moreover, we construct a discriminator for adversarial training, which enhances the visual realism and category discrimination of adversarial examples, thus improving their targeted attack performance. Extensive experiments on widely tested cross-modal retrieval datasets demonstrate the superiority of our proposed method. Also, transferable attacks show that our generated adversarial examples have well generalization capability on targeted attacks. The source codes and datasets are available at https://github.com/tswang0116/TA-DCH.",

keywords = "Targeted adversarial attack, adversarial generation, cross-modal prototype learning, deep cross-modal hashing retrieval",

author = "Tianshi Wang and Lei Zhu and Zheng Zhang and Huaxiang Zhang and Junwei Han",

note = "Publisher Copyright: {\textcopyright} 2023 IEEE.",

year = "2023",

month = oct,

day = "1",

doi = "10.1109/TCSVT.2023.3263054",

language = "英语",

volume = "33",

pages = "6159--6172",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "10",

}

TY - JOUR

T1 - Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval

AU - Wang, Tianshi

AU - Zhu, Lei

AU - Zhang, Zheng

AU - Zhang, Huaxiang

AU - Han, Junwei

PY - 2023/10/1

Y1 - 2023/10/1

N2 - Deep cross-modal hashing has achieved excellent retrieval performance with the powerful representation capability of deep neural networks. Regrettably, current methods are inevitably vulnerable to adversarial attacks, especially well-designed subtle perturbations that can easily fool deep cross-modal hashing models into returning irrelevant or the attacker's specified results. Although adversarial attacks have attracted increasing attention, there are few studies on specialized attacks against deep cross-modal hashing. To solve these issues, we propose a targeted adversarial attack method against deep cross-modal hashing retrieval in this paper. To the best of our knowledge, this is the first work in this research field. Concretely, we first build a progressive fusion module to extract fine-grained target semantics through a progressive attention mechanism. Meanwhile, we design a semantic adaptation network to generate the target prototype code and reconstruct the category label, thus realizing the semantic interaction between the target semantics and the implicit semantics of the attacked model. To bridge modality gaps and preserve local example details, a semantic translator seamlessly translates the target semantics and then embeds them into benign examples in collaboration with a U-Net framework. Moreover, we construct a discriminator for adversarial training, which enhances the visual realism and category discrimination of adversarial examples, thus improving their targeted attack performance. Extensive experiments on widely tested cross-modal retrieval datasets demonstrate the superiority of our proposed method. Also, transferable attacks show that our generated adversarial examples have well generalization capability on targeted attacks. The source codes and datasets are available at https://github.com/tswang0116/TA-DCH.

AB - Deep cross-modal hashing has achieved excellent retrieval performance with the powerful representation capability of deep neural networks. Regrettably, current methods are inevitably vulnerable to adversarial attacks, especially well-designed subtle perturbations that can easily fool deep cross-modal hashing models into returning irrelevant or the attacker's specified results. Although adversarial attacks have attracted increasing attention, there are few studies on specialized attacks against deep cross-modal hashing. To solve these issues, we propose a targeted adversarial attack method against deep cross-modal hashing retrieval in this paper. To the best of our knowledge, this is the first work in this research field. Concretely, we first build a progressive fusion module to extract fine-grained target semantics through a progressive attention mechanism. Meanwhile, we design a semantic adaptation network to generate the target prototype code and reconstruct the category label, thus realizing the semantic interaction between the target semantics and the implicit semantics of the attacked model. To bridge modality gaps and preserve local example details, a semantic translator seamlessly translates the target semantics and then embeds them into benign examples in collaboration with a U-Net framework. Moreover, we construct a discriminator for adversarial training, which enhances the visual realism and category discrimination of adversarial examples, thus improving their targeted attack performance. Extensive experiments on widely tested cross-modal retrieval datasets demonstrate the superiority of our proposed method. Also, transferable attacks show that our generated adversarial examples have well generalization capability on targeted attacks. The source codes and datasets are available at https://github.com/tswang0116/TA-DCH.

KW - Targeted adversarial attack

KW - adversarial generation

KW - cross-modal prototype learning

KW - deep cross-modal hashing retrieval

UR - http://www.scopus.com/inward/record.url?scp=85153333014&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2023.3263054

DO - 10.1109/TCSVT.2023.3263054

M3 - 文章

AN - SCOPUS:85153333014

SN - 1051-8215

VL - 33

SP - 6159

EP - 6172

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 10

ER -

Targeted Adversarial Attack Against Deep Cross-Modal Hashing Retrieval

摘要

访问文件

其它文件与链接

指纹

引用此