Constraining multimodal distribution for domain adaptation in stereo matching

Zhelun Shen, Zhuo Li, Chenming Wu, Zhibo Rao, Lina Liu, Yuchao Dai, Liangjun Zhang

科研成果: 期刊稿件文章同行评审

摘要

Recently, learning-based stereo matching methods have achieved great improvement in public benchmarks, where soft argmin and smooth L1 loss play core contributions to its success. However, in unsupervised domain adaptation scenarios, we observe that these two operations often yield multimodal disparity probability distributions in target domains, resulting in degraded generalization. In this paper, we propose a novel approach, Constrain Multi-modal Distribution (CMD), to address this issue. Specifically, we introduce uncertainty-regularized minimization and anisotropic soft argmin to encourage the network to produce predominantly unimodal disparity distributions in the target domain, thereby improving prediction accuracy. Experimentally, we apply the proposed method to multiple representative stereo-matching networks and conduct domain adaptation from synthetic data to unlabeled real-world scenes. Results consistently demonstrate improved generalization in both top-performing and domain-adaptable stereo-matching models. The code for CMD will be available at: https://github.com/gallenszl/CMD.

源语言英语
文章编号111727
期刊Pattern Recognition
167
DOI
出版状态已出版 - 11月 2025

指纹

探究 'Constraining multimodal distribution for domain adaptation in stereo matching' 的科研主题。它们共同构成独一无二的指纹。

引用此