Improving Stereo Matching Generalization via Fourier-Based Amplitude Transform

Xing Li; Yangyu Fan; Zhibo Rao; Zhe Guo; Guoyun Lv

doi:10.1109/LSP.2022.3180306

Improving Stereo Matching Generalization via Fourier-Based Amplitude Transform

Xing Li, Yangyu Fan, Zhibo Rao, Zhe Guo, Guoyun Lv

电子信息学院

Northwestern Polytechnical University Xian

科研成果: 期刊稿件 › 文章 › 同行评审

10 引用（Scopus）

摘要

Stereo matching CNNs suffer from performance deteriorate when evaluated under different distributions from training data. Previous domain adaptation/generalization methods are hard to maintain a robust performance in different baselines and usually require difficult adversarial optimization or intricate network structure. To solve this problem, we propose Fourier-based amplitude transform (FAT), mapping the source image to the target style without altering semantic content, which requires no training to perform the domain alignment. Specifically, we leverage the Fourier transform and its inverse to swap the low-frequency amplitude component of the source data with the target data. To effectively map style and relieve the artifacts, we introduce two factors to control the replacing area: the distance of HSV distribution between source and target images; and the difference between the source left image and its warped left image. Experiments testify FAT can significantly bridge domain gaps, making source data distribution closer to target data. Furthermore, when only training on synthetic datasets, FAT can also help different baselines achieve competitive cross-domain generalization capabilities on real datasets.

源语言	英语
页（从-至）	1362-1366
页数	5
期刊	IEEE Signal Processing Letters
卷	29
DOI	https://doi.org/10.1109/LSP.2022.3180306
出版状态	已出版 - 2022

访问文件

10.1109/LSP.2022.3180306

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{a365e27f970540ecb0d076982ac7680d,

title = "Improving Stereo Matching Generalization via Fourier-Based Amplitude Transform",

abstract = "Stereo matching CNNs suffer from performance deteriorate when evaluated under different distributions from training data. Previous domain adaptation/generalization methods are hard to maintain a robust performance in different baselines and usually require difficult adversarial optimization or intricate network structure. To solve this problem, we propose Fourier-based amplitude transform (FAT), mapping the source image to the target style without altering semantic content, which requires no training to perform the domain alignment. Specifically, we leverage the Fourier transform and its inverse to swap the low-frequency amplitude component of the source data with the target data. To effectively map style and relieve the artifacts, we introduce two factors to control the replacing area: the distance of HSV distribution between source and target images; and the difference between the source left image and its warped left image. Experiments testify FAT can significantly bridge domain gaps, making source data distribution closer to target data. Furthermore, when only training on synthetic datasets, FAT can also help different baselines achieve competitive cross-domain generalization capabilities on real datasets.",

keywords = "cross-domain generalization capability, Fourier-based amplitude transform, Stereo matching",

author = "Xing Li and Yangyu Fan and Zhibo Rao and Zhe Guo and Guoyun Lv",

note = "Publisher Copyright: {\textcopyright} 1994-2012 IEEE.",

year = "2022",

doi = "10.1109/LSP.2022.3180306",

language = "英语",

volume = "29",

pages = "1362--1366",

journal = "IEEE Signal Processing Letters",

issn = "1070-9908",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - Improving Stereo Matching Generalization via Fourier-Based Amplitude Transform

AU - Li, Xing

AU - Fan, Yangyu

AU - Rao, Zhibo

AU - Guo, Zhe

AU - Lv, Guoyun

PY - 2022

Y1 - 2022

N2 - Stereo matching CNNs suffer from performance deteriorate when evaluated under different distributions from training data. Previous domain adaptation/generalization methods are hard to maintain a robust performance in different baselines and usually require difficult adversarial optimization or intricate network structure. To solve this problem, we propose Fourier-based amplitude transform (FAT), mapping the source image to the target style without altering semantic content, which requires no training to perform the domain alignment. Specifically, we leverage the Fourier transform and its inverse to swap the low-frequency amplitude component of the source data with the target data. To effectively map style and relieve the artifacts, we introduce two factors to control the replacing area: the distance of HSV distribution between source and target images; and the difference between the source left image and its warped left image. Experiments testify FAT can significantly bridge domain gaps, making source data distribution closer to target data. Furthermore, when only training on synthetic datasets, FAT can also help different baselines achieve competitive cross-domain generalization capabilities on real datasets.

AB - Stereo matching CNNs suffer from performance deteriorate when evaluated under different distributions from training data. Previous domain adaptation/generalization methods are hard to maintain a robust performance in different baselines and usually require difficult adversarial optimization or intricate network structure. To solve this problem, we propose Fourier-based amplitude transform (FAT), mapping the source image to the target style without altering semantic content, which requires no training to perform the domain alignment. Specifically, we leverage the Fourier transform and its inverse to swap the low-frequency amplitude component of the source data with the target data. To effectively map style and relieve the artifacts, we introduce two factors to control the replacing area: the distance of HSV distribution between source and target images; and the difference between the source left image and its warped left image. Experiments testify FAT can significantly bridge domain gaps, making source data distribution closer to target data. Furthermore, when only training on synthetic datasets, FAT can also help different baselines achieve competitive cross-domain generalization capabilities on real datasets.

KW - cross-domain generalization capability

KW - Fourier-based amplitude transform

KW - Stereo matching

UR - http://www.scopus.com/inward/record.url?scp=85131716085&partnerID=8YFLogxK

U2 - 10.1109/LSP.2022.3180306

DO - 10.1109/LSP.2022.3180306

M3 - 文章

AN - SCOPUS:85131716085

SN - 1070-9908

VL - 29

SP - 1362

EP - 1366

JO - IEEE Signal Processing Letters

JF - IEEE Signal Processing Letters

ER -

Improving Stereo Matching Generalization via Fourier-Based Amplitude Transform

摘要

访问文件

其它文件与链接

指纹

引用此