Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization

Zhao Rui Guo; Jia Wei Niu; Zhun Ga Liu

doi:10.1109/ICARCV57592.2022.10004308

Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization

Zhao Rui Guo, Jia Wei Niu, Zhun Ga Liu

自动化学院

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

摘要

In target recognition, the information about the target usually exists in several domains captured by different sources (sensors). However, it is difficult for us to obtain the perfect target information as the source domain data due to the sensors' limitations sometimes. For the target classification of visible and infrared paired images, we assume that some classes of visible and infrared paired images and other classes of visible images can be obtained, whereas other classes of unseen infrared images need to be classified. This problem is actually a zero-shot deep domain adaptation (ZDDA) problem which divides the data into task-relevant (T-R) data and task-irrelevant (T-I) data. Moreover, the classes of T-R data require recognition, while the classes of T-I data do not need. The traditional ZDDA method sacrifices the classification accuracy of T-R data in the target domain for the generalization ability of T-I data in the source domain. So we propose a method to solve the problem in another way. More precisely, we first use the image-to-image translation network to learn the mapping between the source domain (visible images) T-I data and the target domain (infrared images) T-I data, and convert the visible T-R images to pseudo-infrared images. Then the pseudo-infrared images and the inverted grayscale T-R images are combined to construct a new hybrid domain (source domain I). Meanwhile, we also construct a hybrid domain (source domain II) of T-I images similarly. Besides, we use the infrared T-I images to construct the third domain (source domain III). Finally, we design a deep domain generalization method for cross-domain infrared image classification. And the total loss consists of the classification loss of the source domain I and the distribution alignment loss between the source domains II and III. We evaluate our method using VAIS ship and RGB-NIR scene datasets. The experimental results demonstrate the effectiveness of the proposed method.

源语言	英语
主期刊名	2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022
出版商	Institute of Electrical and Electronics Engineers Inc.
页	487-493
页数	7
ISBN（电子版）	9781665476874
DOI	https://doi.org/10.1109/ICARCV57592.2022.10004308
出版状态	已出版 - 2022
活动	17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022 - Singapore, 新加坡期限: 11 12月 2022 → 13 12月 2022

出版系列

姓名	2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022

会议

会议	17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022
国家/地区	新加坡
市	Singapore
时期	11/12/22 → 13/12/22

访问文件

10.1109/ICARCV57592.2022.10004308

其它文件与链接

链接到 Scopus 的出版物

引用此

Guo, Z. R., Niu, J. W., & Liu, Z. G. (2022). Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization. 在 2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022 (页码 487-493). (2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICARCV57592.2022.10004308

Guo, Zhao Rui ; Niu, Jia Wei ; Liu, Zhun Ga. / Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization. 2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022. Institute of Electrical and Electronics Engineers Inc., 2022. 页码 487-493 (2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022).

@inproceedings{8c4f30cb653c4138a3551537d0d37c39,

title = "Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization",

abstract = "In target recognition, the information about the target usually exists in several domains captured by different sources (sensors). However, it is difficult for us to obtain the perfect target information as the source domain data due to the sensors' limitations sometimes. For the target classification of visible and infrared paired images, we assume that some classes of visible and infrared paired images and other classes of visible images can be obtained, whereas other classes of unseen infrared images need to be classified. This problem is actually a zero-shot deep domain adaptation (ZDDA) problem which divides the data into task-relevant (T-R) data and task-irrelevant (T-I) data. Moreover, the classes of T-R data require recognition, while the classes of T-I data do not need. The traditional ZDDA method sacrifices the classification accuracy of T-R data in the target domain for the generalization ability of T-I data in the source domain. So we propose a method to solve the problem in another way. More precisely, we first use the image-to-image translation network to learn the mapping between the source domain (visible images) T-I data and the target domain (infrared images) T-I data, and convert the visible T-R images to pseudo-infrared images. Then the pseudo-infrared images and the inverted grayscale T-R images are combined to construct a new hybrid domain (source domain I). Meanwhile, we also construct a hybrid domain (source domain II) of T-I images similarly. Besides, we use the infrared T-I images to construct the third domain (source domain III). Finally, we design a deep domain generalization method for cross-domain infrared image classification. And the total loss consists of the classification loss of the source domain I and the distribution alignment loss between the source domains II and III. We evaluate our method using VAIS ship and RGB-NIR scene datasets. The experimental results demonstrate the effectiveness of the proposed method.",

author = "Guo, {Zhao Rui} and Niu, {Jia Wei} and Liu, {Zhun Ga}",

note = "Publisher Copyright: {\textcopyright} 2022 IEEE.; 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022 ; Conference date: 11-12-2022 Through 13-12-2022",

year = "2022",

doi = "10.1109/ICARCV57592.2022.10004308",

language = "英语",

series = "2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "487--493",

booktitle = "2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022",

}

Guo, ZR, Niu, JW & Liu, ZG 2022, Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization. 在 2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022. 2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022, Institute of Electrical and Electronics Engineers Inc., 页码 487-493, 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022, Singapore, 新加坡, 11/12/22. https://doi.org/10.1109/ICARCV57592.2022.10004308

Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization. / Guo, Zhao Rui; Niu, Jia Wei; Liu, Zhun Ga.
2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022. Institute of Electrical and Electronics Engineers Inc., 2022. 页码 487-493 (2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization

AU - Guo, Zhao Rui

AU - Niu, Jia Wei

AU - Liu, Zhun Ga

PY - 2022

Y1 - 2022

N2 - In target recognition, the information about the target usually exists in several domains captured by different sources (sensors). However, it is difficult for us to obtain the perfect target information as the source domain data due to the sensors' limitations sometimes. For the target classification of visible and infrared paired images, we assume that some classes of visible and infrared paired images and other classes of visible images can be obtained, whereas other classes of unseen infrared images need to be classified. This problem is actually a zero-shot deep domain adaptation (ZDDA) problem which divides the data into task-relevant (T-R) data and task-irrelevant (T-I) data. Moreover, the classes of T-R data require recognition, while the classes of T-I data do not need. The traditional ZDDA method sacrifices the classification accuracy of T-R data in the target domain for the generalization ability of T-I data in the source domain. So we propose a method to solve the problem in another way. More precisely, we first use the image-to-image translation network to learn the mapping between the source domain (visible images) T-I data and the target domain (infrared images) T-I data, and convert the visible T-R images to pseudo-infrared images. Then the pseudo-infrared images and the inverted grayscale T-R images are combined to construct a new hybrid domain (source domain I). Meanwhile, we also construct a hybrid domain (source domain II) of T-I images similarly. Besides, we use the infrared T-I images to construct the third domain (source domain III). Finally, we design a deep domain generalization method for cross-domain infrared image classification. And the total loss consists of the classification loss of the source domain I and the distribution alignment loss between the source domains II and III. We evaluate our method using VAIS ship and RGB-NIR scene datasets. The experimental results demonstrate the effectiveness of the proposed method.

AB - In target recognition, the information about the target usually exists in several domains captured by different sources (sensors). However, it is difficult for us to obtain the perfect target information as the source domain data due to the sensors' limitations sometimes. For the target classification of visible and infrared paired images, we assume that some classes of visible and infrared paired images and other classes of visible images can be obtained, whereas other classes of unseen infrared images need to be classified. This problem is actually a zero-shot deep domain adaptation (ZDDA) problem which divides the data into task-relevant (T-R) data and task-irrelevant (T-I) data. Moreover, the classes of T-R data require recognition, while the classes of T-I data do not need. The traditional ZDDA method sacrifices the classification accuracy of T-R data in the target domain for the generalization ability of T-I data in the source domain. So we propose a method to solve the problem in another way. More precisely, we first use the image-to-image translation network to learn the mapping between the source domain (visible images) T-I data and the target domain (infrared images) T-I data, and convert the visible T-R images to pseudo-infrared images. Then the pseudo-infrared images and the inverted grayscale T-R images are combined to construct a new hybrid domain (source domain I). Meanwhile, we also construct a hybrid domain (source domain II) of T-I images similarly. Besides, we use the infrared T-I images to construct the third domain (source domain III). Finally, we design a deep domain generalization method for cross-domain infrared image classification. And the total loss consists of the classification loss of the source domain I and the distribution alignment loss between the source domains II and III. We evaluate our method using VAIS ship and RGB-NIR scene datasets. The experimental results demonstrate the effectiveness of the proposed method.

UR - http://www.scopus.com/inward/record.url?scp=85146742620&partnerID=8YFLogxK

U2 - 10.1109/ICARCV57592.2022.10004308

DO - 10.1109/ICARCV57592.2022.10004308

M3 - 会议稿件

AN - SCOPUS:85146742620

T3 - 2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022

SP - 487

EP - 493

BT - 2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022

Y2 - 11 December 2022 through 13 December 2022

ER -

Guo ZR, Niu JW, Liu ZG. Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization. 在 2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022. Institute of Electrical and Electronics Engineers Inc. 2022. 页码 487-493. (2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022). doi: 10.1109/ICARCV57592.2022.10004308

Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此