DDFN: Deblurring Dictionary Encoding Fusion Network for Infrared and Visible Image Object Detection

Jiawei Lai; Jie Geng; Xinyang Deng; Wen Jiang

doi:10.1109/LGRS.2023.3311176

DDFN: Deblurring Dictionary Encoding Fusion Network for Infrared and Visible Image Object Detection

Jiawei Lai, Jie Geng, Xinyang Deng, Wen Jiang

电子信息学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

5 引用（Scopus）

摘要

Both infrared and visible images have advantages for object detection, since infrared images (IRs) can capture thermal characteristics of objects and visible images can provide high spatial resolution and clear texture details of objects. Combining infrared and visible images for object detection has many advantages, but how to fully utilize the inherent characteristics of these two data is still a challenging issue. To address this issue, a deblurring dictionary encoding fusion network (DDFN) is proposed for infrared and visible image object detection. First, a dual-stream feature extraction backbone is structured, which aims to learn features based on the characteristics of different modalities. Then, pooling operations are applied to filter out key information and reduce the complexity of the network. Afterward, a fuzzy compensation module (FCM) is proposed, which aims to minimize the information loss of the pooling process. Finally, a dictionary encoding fusion module (DEFM) is proposed to robustly excavate potential interactions between infrared and visible images, which can obtain fusion features by aggregating the local information of infrared features and the long-term-dependent information of visible features. The proposed DDFN exhibits excellent performance on two benchmark bimodal datasets and shows superior capabilities in object detection of infrared-visible images.

源语言	英语
文章编号	6009705
期刊	IEEE Geoscience and Remote Sensing Letters
卷	20
DOI	https://doi.org/10.1109/LGRS.2023.3311176
出版状态	已出版 - 2023

访问文件

10.1109/LGRS.2023.3311176

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{9cf056c7f9a64f358420a86461576e4f,

title = "DDFN: Deblurring Dictionary Encoding Fusion Network for Infrared and Visible Image Object Detection",

abstract = "Both infrared and visible images have advantages for object detection, since infrared images (IRs) can capture thermal characteristics of objects and visible images can provide high spatial resolution and clear texture details of objects. Combining infrared and visible images for object detection has many advantages, but how to fully utilize the inherent characteristics of these two data is still a challenging issue. To address this issue, a deblurring dictionary encoding fusion network (DDFN) is proposed for infrared and visible image object detection. First, a dual-stream feature extraction backbone is structured, which aims to learn features based on the characteristics of different modalities. Then, pooling operations are applied to filter out key information and reduce the complexity of the network. Afterward, a fuzzy compensation module (FCM) is proposed, which aims to minimize the information loss of the pooling process. Finally, a dictionary encoding fusion module (DEFM) is proposed to robustly excavate potential interactions between infrared and visible images, which can obtain fusion features by aggregating the local information of infrared features and the long-term-dependent information of visible features. The proposed DDFN exhibits excellent performance on two benchmark bimodal datasets and shows superior capabilities in object detection of infrared-visible images.",

keywords = "Dual-stream feature extraction, feature fusion, infrared image (IR), object detection",

author = "Jiawei Lai and Jie Geng and Xinyang Deng and Wen Jiang",

note = "Publisher Copyright: {\textcopyright} 2004-2012 IEEE.",

year = "2023",

doi = "10.1109/LGRS.2023.3311176",

language = "英语",

volume = "20",

journal = "IEEE Geoscience and Remote Sensing Letters",

issn = "1545-598X",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - DDFN

T2 - Deblurring Dictionary Encoding Fusion Network for Infrared and Visible Image Object Detection

AU - Lai, Jiawei

AU - Geng, Jie

AU - Deng, Xinyang

AU - Jiang, Wen

PY - 2023

Y1 - 2023

N2 - Both infrared and visible images have advantages for object detection, since infrared images (IRs) can capture thermal characteristics of objects and visible images can provide high spatial resolution and clear texture details of objects. Combining infrared and visible images for object detection has many advantages, but how to fully utilize the inherent characteristics of these two data is still a challenging issue. To address this issue, a deblurring dictionary encoding fusion network (DDFN) is proposed for infrared and visible image object detection. First, a dual-stream feature extraction backbone is structured, which aims to learn features based on the characteristics of different modalities. Then, pooling operations are applied to filter out key information and reduce the complexity of the network. Afterward, a fuzzy compensation module (FCM) is proposed, which aims to minimize the information loss of the pooling process. Finally, a dictionary encoding fusion module (DEFM) is proposed to robustly excavate potential interactions between infrared and visible images, which can obtain fusion features by aggregating the local information of infrared features and the long-term-dependent information of visible features. The proposed DDFN exhibits excellent performance on two benchmark bimodal datasets and shows superior capabilities in object detection of infrared-visible images.

AB - Both infrared and visible images have advantages for object detection, since infrared images (IRs) can capture thermal characteristics of objects and visible images can provide high spatial resolution and clear texture details of objects. Combining infrared and visible images for object detection has many advantages, but how to fully utilize the inherent characteristics of these two data is still a challenging issue. To address this issue, a deblurring dictionary encoding fusion network (DDFN) is proposed for infrared and visible image object detection. First, a dual-stream feature extraction backbone is structured, which aims to learn features based on the characteristics of different modalities. Then, pooling operations are applied to filter out key information and reduce the complexity of the network. Afterward, a fuzzy compensation module (FCM) is proposed, which aims to minimize the information loss of the pooling process. Finally, a dictionary encoding fusion module (DEFM) is proposed to robustly excavate potential interactions between infrared and visible images, which can obtain fusion features by aggregating the local information of infrared features and the long-term-dependent information of visible features. The proposed DDFN exhibits excellent performance on two benchmark bimodal datasets and shows superior capabilities in object detection of infrared-visible images.

KW - Dual-stream feature extraction

KW - feature fusion

KW - infrared image (IR)

KW - object detection

UR - http://www.scopus.com/inward/record.url?scp=85170567346&partnerID=8YFLogxK

U2 - 10.1109/LGRS.2023.3311176

DO - 10.1109/LGRS.2023.3311176

M3 - 文章

AN - SCOPUS:85170567346

SN - 1545-598X

VL - 20

JO - IEEE Geoscience and Remote Sensing Letters

JF - IEEE Geoscience and Remote Sensing Letters

M1 - 6009705

ER -

DDFN: Deblurring Dictionary Encoding Fusion Network for Infrared and Visible Image Object Detection

摘要

访问文件

其它文件与链接

指纹

引用此