Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization

Zhao Rui Guo, Jia Wei Niu, Zhun Ga Liu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In target recognition, the information about the target usually exists in several domains captured by different sources (sensors). However, it is difficult for us to obtain the perfect target information as the source domain data due to the sensors' limitations sometimes. For the target classification of visible and infrared paired images, we assume that some classes of visible and infrared paired images and other classes of visible images can be obtained, whereas other classes of unseen infrared images need to be classified. This problem is actually a zero-shot deep domain adaptation (ZDDA) problem which divides the data into task-relevant (T-R) data and task-irrelevant (T-I) data. Moreover, the classes of T-R data require recognition, while the classes of T-I data do not need. The traditional ZDDA method sacrifices the classification accuracy of T-R data in the target domain for the generalization ability of T-I data in the source domain. So we propose a method to solve the problem in another way. More precisely, we first use the image-to-image translation network to learn the mapping between the source domain (visible images) T-I data and the target domain (infrared images) T-I data, and convert the visible T-R images to pseudo-infrared images. Then the pseudo-infrared images and the inverted grayscale T-R images are combined to construct a new hybrid domain (source domain I). Meanwhile, we also construct a hybrid domain (source domain II) of T-I images similarly. Besides, we use the infrared T-I images to construct the third domain (source domain III). Finally, we design a deep domain generalization method for cross-domain infrared image classification. And the total loss consists of the classification loss of the source domain I and the distribution alignment loss between the source domains II and III. We evaluate our method using VAIS ship and RGB-NIR scene datasets. The experimental results demonstrate the effectiveness of the proposed method.

Original languageEnglish
Title of host publication2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages487-493
Number of pages7
ISBN (Electronic)9781665476874
DOIs
StatePublished - 2022
Event17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022 - Singapore, Singapore
Duration: 11 Dec 202213 Dec 2022

Publication series

Name2022 17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022

Conference

Conference17th International Conference on Control, Automation, Robotics and Vision, ICARCV 2022
Country/TerritorySingapore
CitySingapore
Period11/12/2213/12/22

Fingerprint

Dive into the research topics of 'Cross-Domain Infrared Image Classification via Image-to-Image Translation and Deep Domain Generalization'. Together they form a unique fingerprint.

Cite this