Domain adaptation with temporal ensembling to local attention region search for object detection

Haobin Shi; Ziming He; Kao Shing Hwang

doi:10.1016/j.knosys.2024.112846

Domain adaptation with temporal ensembling to local attention region search for object detection

Haobin Shi, Ziming He, Kao Shing Hwang

School of Computer Science

Research output: Contribution to journal › Article › peer-review

Abstract

Object detection relies heavily on supervised learning, which requires labeled data for training. However, manual labeling often cannot keep pace with the speed of data collection, and models trained on one dataset may not generalize well to new datasets with different characteristics, leading to domain shift issues. Domain adaptation addresses this problem by leveraging labeled data from a source domain and unlabeled data from a target domain to improve performance on the target domain. Limited by the existing domain adaption architecture, the object detection accuracy in the target domain has much room for improvement. In addition, the global search of feature maps costs too much computation. All these problems make it difficult for domain adaptive object detection to be directly applied to tasks such as medical imaging. To this end, this article proposes two architectures: Region-based Object Detection with Domain Adaptation and Temporal Ensembling (DATE) and Local Attention Region Search Algorithm (LARSA). DATE combines domain adaptation and temporal ensembling to enhance feature alignment between domains. At the same time, LARSA employs an attention mechanism to efficiently search for regions of interest and decide when to terminate the search early. Experiments on various datasets demonstrate the effectiveness of the proposed approaches in improving object detection performance under domain shift and reducing computational cost. The proposed framework has the potential to further promote the application of object detection in the field of medical imaging.

Original language	English
Article number	112846
Journal	Knowledge-Based Systems
Volume	309
DOIs	https://doi.org/10.1016/j.knosys.2024.112846
State	Published - 30 Jan 2025

Keywords

Attention mechanism
Domain adaptation
Medical imaging
Object detection
Reinforcement learning
Temporal ensembling

Access to Document

10.1016/j.knosys.2024.112846

Cite this

@article{21ca018d590f42ba9a1724a1c8971608,

title = "Domain adaptation with temporal ensembling to local attention region search for object detection",

abstract = "Object detection relies heavily on supervised learning, which requires labeled data for training. However, manual labeling often cannot keep pace with the speed of data collection, and models trained on one dataset may not generalize well to new datasets with different characteristics, leading to domain shift issues. Domain adaptation addresses this problem by leveraging labeled data from a source domain and unlabeled data from a target domain to improve performance on the target domain. Limited by the existing domain adaption architecture, the object detection accuracy in the target domain has much room for improvement. In addition, the global search of feature maps costs too much computation. All these problems make it difficult for domain adaptive object detection to be directly applied to tasks such as medical imaging. To this end, this article proposes two architectures: Region-based Object Detection with Domain Adaptation and Temporal Ensembling (DATE) and Local Attention Region Search Algorithm (LARSA). DATE combines domain adaptation and temporal ensembling to enhance feature alignment between domains. At the same time, LARSA employs an attention mechanism to efficiently search for regions of interest and decide when to terminate the search early. Experiments on various datasets demonstrate the effectiveness of the proposed approaches in improving object detection performance under domain shift and reducing computational cost. The proposed framework has the potential to further promote the application of object detection in the field of medical imaging.",

keywords = "Attention mechanism, Domain adaptation, Medical imaging, Object detection, Reinforcement learning, Temporal ensembling",

author = "Haobin Shi and Ziming He and Hwang, {Kao Shing}",

note = "Publisher Copyright: {\textcopyright} 2024 Elsevier B.V.",

year = "2025",

month = jan,

day = "30",

doi = "10.1016/j.knosys.2024.112846",

language = "英语",

volume = "309",

journal = "Knowledge-Based Systems",

issn = "0950-7051",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - Domain adaptation with temporal ensembling to local attention region search for object detection

AU - Shi, Haobin

AU - He, Ziming

AU - Hwang, Kao Shing

PY - 2025/1/30

Y1 - 2025/1/30

N2 - Object detection relies heavily on supervised learning, which requires labeled data for training. However, manual labeling often cannot keep pace with the speed of data collection, and models trained on one dataset may not generalize well to new datasets with different characteristics, leading to domain shift issues. Domain adaptation addresses this problem by leveraging labeled data from a source domain and unlabeled data from a target domain to improve performance on the target domain. Limited by the existing domain adaption architecture, the object detection accuracy in the target domain has much room for improvement. In addition, the global search of feature maps costs too much computation. All these problems make it difficult for domain adaptive object detection to be directly applied to tasks such as medical imaging. To this end, this article proposes two architectures: Region-based Object Detection with Domain Adaptation and Temporal Ensembling (DATE) and Local Attention Region Search Algorithm (LARSA). DATE combines domain adaptation and temporal ensembling to enhance feature alignment between domains. At the same time, LARSA employs an attention mechanism to efficiently search for regions of interest and decide when to terminate the search early. Experiments on various datasets demonstrate the effectiveness of the proposed approaches in improving object detection performance under domain shift and reducing computational cost. The proposed framework has the potential to further promote the application of object detection in the field of medical imaging.

AB - Object detection relies heavily on supervised learning, which requires labeled data for training. However, manual labeling often cannot keep pace with the speed of data collection, and models trained on one dataset may not generalize well to new datasets with different characteristics, leading to domain shift issues. Domain adaptation addresses this problem by leveraging labeled data from a source domain and unlabeled data from a target domain to improve performance on the target domain. Limited by the existing domain adaption architecture, the object detection accuracy in the target domain has much room for improvement. In addition, the global search of feature maps costs too much computation. All these problems make it difficult for domain adaptive object detection to be directly applied to tasks such as medical imaging. To this end, this article proposes two architectures: Region-based Object Detection with Domain Adaptation and Temporal Ensembling (DATE) and Local Attention Region Search Algorithm (LARSA). DATE combines domain adaptation and temporal ensembling to enhance feature alignment between domains. At the same time, LARSA employs an attention mechanism to efficiently search for regions of interest and decide when to terminate the search early. Experiments on various datasets demonstrate the effectiveness of the proposed approaches in improving object detection performance under domain shift and reducing computational cost. The proposed framework has the potential to further promote the application of object detection in the field of medical imaging.

KW - Attention mechanism

KW - Domain adaptation

KW - Medical imaging

KW - Object detection

KW - Reinforcement learning

KW - Temporal ensembling

UR - http://www.scopus.com/inward/record.url?scp=85211989418&partnerID=8YFLogxK

U2 - 10.1016/j.knosys.2024.112846

DO - 10.1016/j.knosys.2024.112846

M3 - 文章

AN - SCOPUS:85211989418

SN - 0950-7051

VL - 309

JO - Knowledge-Based Systems

JF - Knowledge-Based Systems

M1 - 112846

ER -

Domain adaptation with temporal ensembling to local attention region search for object detection

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this