Boosting Object Detectors via Strong-Classification Weak-Localization Pretraining in Remote Sensing Imagery

Cong Zhang; Tianshan Liu; Jun Xiao; Kin Man Lam; Qi Wang

doi:10.1109/TIM.2023.3315392

Boosting Object Detectors via Strong-Classification Weak-Localization Pretraining in Remote Sensing Imagery

Cong Zhang, Tianshan Liu, Jun Xiao, Kin Man Lam, Qi Wang

光电与智能研究院

Hong Kong Polytechnic University

科研成果: 期刊稿件 › 文章 › 同行评审

30 引用（Scopus）

摘要

Deep learning-based object detectors in remote sensing (RS) scenarios typically follow the paradigm of pretraining and fine-tuning to alleviate the limitation of insufficient downstream data. Despite the improved performance, existing pretraining paradigms are suboptimal due to three deficiencies: 1) inconsistent domains, i.e., pretraining on natural scenes and fine-tuning for RS scenes; 2) mismatched task objectives, i.e., classification-oriented pretraining while detection-oriented fine-tuning; and 3) misaligned architectures, i.e., pretraining only one bare backbone yet neglecting other vital detection components. Against these issues, this article proposes a novel pretraining paradigm specifically for the task of RS object detection, namely, RS strong-classification weak-localization (SCWL) pretraining. Unlike conventional classification pretraining, such as the widely used ImageNet pretraining, our pretraining strategy can adaptively perform bounding box generation on a reconstructed large-scale RS classification-style dataset. These pseudobounding boxes are integrated with the original accurate class labels as location- and category-related supervisions, respectively, to pretrain the entire RS detectors. The proposed RS SCWL pretraining paradigm is able to significantly improve downstream detection performance and outperforms classification pretraining methods, including ImageNet pretraining. Extensive experiments on different object detection datasets demonstrate its effectiveness and superiority in boosting various RS detectors.

源语言	英语
文章编号	5026520
期刊	IEEE Transactions on Instrumentation and Measurement
卷	72
DOI	https://doi.org/10.1109/TIM.2023.3315392
出版状态	已出版 - 2023

访问文件

10.1109/TIM.2023.3315392

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{f30528f5b3d141b9a0d54f440865b73f,

title = "Boosting Object Detectors via Strong-Classification Weak-Localization Pretraining in Remote Sensing Imagery",

abstract = "Deep learning-based object detectors in remote sensing (RS) scenarios typically follow the paradigm of pretraining and fine-tuning to alleviate the limitation of insufficient downstream data. Despite the improved performance, existing pretraining paradigms are suboptimal due to three deficiencies: 1) inconsistent domains, i.e., pretraining on natural scenes and fine-tuning for RS scenes; 2) mismatched task objectives, i.e., classification-oriented pretraining while detection-oriented fine-tuning; and 3) misaligned architectures, i.e., pretraining only one bare backbone yet neglecting other vital detection components. Against these issues, this article proposes a novel pretraining paradigm specifically for the task of RS object detection, namely, RS strong-classification weak-localization (SCWL) pretraining. Unlike conventional classification pretraining, such as the widely used ImageNet pretraining, our pretraining strategy can adaptively perform bounding box generation on a reconstructed large-scale RS classification-style dataset. These pseudobounding boxes are integrated with the original accurate class labels as location- and category-related supervisions, respectively, to pretrain the entire RS detectors. The proposed RS SCWL pretraining paradigm is able to significantly improve downstream detection performance and outperforms classification pretraining methods, including ImageNet pretraining. Extensive experiments on different object detection datasets demonstrate its effectiveness and superiority in boosting various RS detectors.",

keywords = "Object detection, pretraining paradigms, remote sensing (RS) imagery, scene classification, weakly supervised object localization (WSOL)",

author = "Cong Zhang and Tianshan Liu and Jun Xiao and Lam, {Kin Man} and Qi Wang",

note = "Publisher Copyright: {\textcopyright} 1963-2012 IEEE.",

year = "2023",

doi = "10.1109/TIM.2023.3315392",

language = "英语",

volume = "72",

journal = "IEEE Transactions on Instrumentation and Measurement",

issn = "0018-9456",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Boosting Object Detectors via Strong-Classification Weak-Localization Pretraining in Remote Sensing Imagery

AU - Zhang, Cong

AU - Liu, Tianshan

AU - Xiao, Jun

AU - Lam, Kin Man

AU - Wang, Qi

PY - 2023

Y1 - 2023

N2 - Deep learning-based object detectors in remote sensing (RS) scenarios typically follow the paradigm of pretraining and fine-tuning to alleviate the limitation of insufficient downstream data. Despite the improved performance, existing pretraining paradigms are suboptimal due to three deficiencies: 1) inconsistent domains, i.e., pretraining on natural scenes and fine-tuning for RS scenes; 2) mismatched task objectives, i.e., classification-oriented pretraining while detection-oriented fine-tuning; and 3) misaligned architectures, i.e., pretraining only one bare backbone yet neglecting other vital detection components. Against these issues, this article proposes a novel pretraining paradigm specifically for the task of RS object detection, namely, RS strong-classification weak-localization (SCWL) pretraining. Unlike conventional classification pretraining, such as the widely used ImageNet pretraining, our pretraining strategy can adaptively perform bounding box generation on a reconstructed large-scale RS classification-style dataset. These pseudobounding boxes are integrated with the original accurate class labels as location- and category-related supervisions, respectively, to pretrain the entire RS detectors. The proposed RS SCWL pretraining paradigm is able to significantly improve downstream detection performance and outperforms classification pretraining methods, including ImageNet pretraining. Extensive experiments on different object detection datasets demonstrate its effectiveness and superiority in boosting various RS detectors.

AB - Deep learning-based object detectors in remote sensing (RS) scenarios typically follow the paradigm of pretraining and fine-tuning to alleviate the limitation of insufficient downstream data. Despite the improved performance, existing pretraining paradigms are suboptimal due to three deficiencies: 1) inconsistent domains, i.e., pretraining on natural scenes and fine-tuning for RS scenes; 2) mismatched task objectives, i.e., classification-oriented pretraining while detection-oriented fine-tuning; and 3) misaligned architectures, i.e., pretraining only one bare backbone yet neglecting other vital detection components. Against these issues, this article proposes a novel pretraining paradigm specifically for the task of RS object detection, namely, RS strong-classification weak-localization (SCWL) pretraining. Unlike conventional classification pretraining, such as the widely used ImageNet pretraining, our pretraining strategy can adaptively perform bounding box generation on a reconstructed large-scale RS classification-style dataset. These pseudobounding boxes are integrated with the original accurate class labels as location- and category-related supervisions, respectively, to pretrain the entire RS detectors. The proposed RS SCWL pretraining paradigm is able to significantly improve downstream detection performance and outperforms classification pretraining methods, including ImageNet pretraining. Extensive experiments on different object detection datasets demonstrate its effectiveness and superiority in boosting various RS detectors.

KW - Object detection

KW - pretraining paradigms

KW - remote sensing (RS) imagery

KW - scene classification

KW - weakly supervised object localization (WSOL)

UR - http://www.scopus.com/inward/record.url?scp=85173002982&partnerID=8YFLogxK

U2 - 10.1109/TIM.2023.3315392

DO - 10.1109/TIM.2023.3315392

M3 - 文章

AN - SCOPUS:85173002982

SN - 0018-9456

VL - 72

JO - IEEE Transactions on Instrumentation and Measurement

JF - IEEE Transactions on Instrumentation and Measurement

M1 - 5026520

ER -

Boosting Object Detectors via Strong-Classification Weak-Localization Pretraining in Remote Sensing Imagery

摘要

访问文件

其它文件与链接

指纹

引用此