H3T: Hierarchical Transferable Transformer with TokenMix for Unsupervised Domain Adaptation

Yihua Ren, Junyu Gao, Yuan Yuan

科研成果: 期刊稿件文章同行评审

摘要

Recent research has been focused on exploring the capabilities of Vision Transformers (ViTs) in Unsupervised Domain Adaptation (UDA). This approach typically involves providing more significant attention to fine-grained common information through patch-level transferable discrimination. However, prematurely assigning narrow-range transferability information at the encoding stage can sparse image information, thereby increasing the difficulty of downstream tasks. Therefore, we propose a Hierarchical Transferable Transformer with TokenMix (H3T), which maintains the allocation of fine-grained transferability at the encoding stage while enhancing the learning strength of image information through feature mixup. To address the challenge of missing sample labels in the target domain within the domain adaptation task, we have specifically designed the TokenMix Module (TMM) for ViTs. This module learns the style information from both domains while alleviating the impact of image sparsity on downstream tasks. Furthermore, to enhance the semantic connections among narrow-range image transfer messages, we propose the Hierarchical Discriminative Module (HDM), which also serves a critical role in encoding discriminative information. Our approach underwent comprehensive experimentation across five datasets of varying sizes, demonstrating its effectiveness. Our code is available at https://github.com/reyihua/H3T.

源语言英语
文章编号125543
期刊Expert Systems with Applications
262
DOI
出版状态已出版 - 1 3月 2025

指纹

探究 'H3T: Hierarchical Transferable Transformer with TokenMix for Unsupervised Domain Adaptation' 的科研主题。它们共同构成独一无二的指纹。

引用此