Toward High-Quality HDR Deghosting with Conditional Diffusion Models

Qingsen Yan; Tao Hu; Yuan Sun; Hao Tang; Yu Zhu; Wei Dong; Luc Van Gool; Yanning Zhang

doi:10.1109/TCSVT.2023.3326293

Toward High-Quality HDR Deghosting with Conditional Diffusion Models

Qingsen Yan, Tao Hu, Yuan Sun, Hao Tang, Yu Zhu, Wei Dong, Luc Van Gool, Yanning Zhang

计算机学院

科研成果: 期刊稿件 › 文章 › 同行评审

40 引用（Scopus）

摘要

High Dynamic Range (HDR) images can be recovered from several Low Dynamic Range (LDR) images by existing Deep Neural Networks (DNNs) techniques. Despite the remarkable progress, DNN-based methods still generate ghosting artifacts when LDR images have saturation and large motion, which hinders potential applications in real-world scenarios. To address this challenge, we formulate the HDR deghosting problem as an image generation that leverages LDR features as the diffusion model's condition, consisting of the feature condition generator and the noise predictor. Feature condition generator employs attention and Domain Feature Alignment (DFA) layer to transform the intermediate features to avoid ghosting artifacts. With the learned features as conditions, the noise predictor leverages a stochastic iterative denoising process for diffusion models to generate an HDR image by steering the sampling process. Furthermore, to mitigate semantic confusion caused by the saturation problem of LDR images, we design a sliding window noise estimator to sample smooth noise in a patch-based manner. In addition, an image space loss is proposed to avoid the color distortion of the estimated HDR results. We empirically evaluate our model on benchmark datasets for HDR imaging. The results demonstrate that our approach achieves state-of-the-art performances and well generalization to real-world images.

源语言	英语
页（从-至）	4011-4026
页数	16
期刊	IEEE Transactions on Circuits and Systems for Video Technology
卷	34
期	5
DOI	https://doi.org/10.1109/TCSVT.2023.3326293
出版状态	已出版 - 1 5月 2024

访问文件

10.1109/TCSVT.2023.3326293

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{c51156791ce64e80971c7ff15d0f2443,

title = "Toward High-Quality HDR Deghosting with Conditional Diffusion Models",

abstract = "High Dynamic Range (HDR) images can be recovered from several Low Dynamic Range (LDR) images by existing Deep Neural Networks (DNNs) techniques. Despite the remarkable progress, DNN-based methods still generate ghosting artifacts when LDR images have saturation and large motion, which hinders potential applications in real-world scenarios. To address this challenge, we formulate the HDR deghosting problem as an image generation that leverages LDR features as the diffusion model's condition, consisting of the feature condition generator and the noise predictor. Feature condition generator employs attention and Domain Feature Alignment (DFA) layer to transform the intermediate features to avoid ghosting artifacts. With the learned features as conditions, the noise predictor leverages a stochastic iterative denoising process for diffusion models to generate an HDR image by steering the sampling process. Furthermore, to mitigate semantic confusion caused by the saturation problem of LDR images, we design a sliding window noise estimator to sample smooth noise in a patch-based manner. In addition, an image space loss is proposed to avoid the color distortion of the estimated HDR results. We empirically evaluate our model on benchmark datasets for HDR imaging. The results demonstrate that our approach achieves state-of-the-art performances and well generalization to real-world images.",

keywords = "High dynamic range image, diffusion model, ghosting artifacts, multi-exposed imaging",

author = "Qingsen Yan and Tao Hu and Yuan Sun and Hao Tang and Yu Zhu and Wei Dong and {Van Gool}, Luc and Yanning Zhang",

note = "Publisher Copyright: {\textcopyright} 1991-2012 IEEE.",

year = "2024",

month = may,

day = "1",

doi = "10.1109/TCSVT.2023.3326293",

language = "英语",

volume = "34",

pages = "4011--4026",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "5",

}

TY - JOUR

T1 - Toward High-Quality HDR Deghosting with Conditional Diffusion Models

AU - Yan, Qingsen

AU - Hu, Tao

AU - Sun, Yuan

AU - Tang, Hao

AU - Zhu, Yu

AU - Dong, Wei

AU - Van Gool, Luc

AU - Zhang, Yanning

PY - 2024/5/1

Y1 - 2024/5/1

N2 - High Dynamic Range (HDR) images can be recovered from several Low Dynamic Range (LDR) images by existing Deep Neural Networks (DNNs) techniques. Despite the remarkable progress, DNN-based methods still generate ghosting artifacts when LDR images have saturation and large motion, which hinders potential applications in real-world scenarios. To address this challenge, we formulate the HDR deghosting problem as an image generation that leverages LDR features as the diffusion model's condition, consisting of the feature condition generator and the noise predictor. Feature condition generator employs attention and Domain Feature Alignment (DFA) layer to transform the intermediate features to avoid ghosting artifacts. With the learned features as conditions, the noise predictor leverages a stochastic iterative denoising process for diffusion models to generate an HDR image by steering the sampling process. Furthermore, to mitigate semantic confusion caused by the saturation problem of LDR images, we design a sliding window noise estimator to sample smooth noise in a patch-based manner. In addition, an image space loss is proposed to avoid the color distortion of the estimated HDR results. We empirically evaluate our model on benchmark datasets for HDR imaging. The results demonstrate that our approach achieves state-of-the-art performances and well generalization to real-world images.

AB - High Dynamic Range (HDR) images can be recovered from several Low Dynamic Range (LDR) images by existing Deep Neural Networks (DNNs) techniques. Despite the remarkable progress, DNN-based methods still generate ghosting artifacts when LDR images have saturation and large motion, which hinders potential applications in real-world scenarios. To address this challenge, we formulate the HDR deghosting problem as an image generation that leverages LDR features as the diffusion model's condition, consisting of the feature condition generator and the noise predictor. Feature condition generator employs attention and Domain Feature Alignment (DFA) layer to transform the intermediate features to avoid ghosting artifacts. With the learned features as conditions, the noise predictor leverages a stochastic iterative denoising process for diffusion models to generate an HDR image by steering the sampling process. Furthermore, to mitigate semantic confusion caused by the saturation problem of LDR images, we design a sliding window noise estimator to sample smooth noise in a patch-based manner. In addition, an image space loss is proposed to avoid the color distortion of the estimated HDR results. We empirically evaluate our model on benchmark datasets for HDR imaging. The results demonstrate that our approach achieves state-of-the-art performances and well generalization to real-world images.

KW - High dynamic range image

KW - diffusion model

KW - ghosting artifacts

KW - multi-exposed imaging

UR - http://www.scopus.com/inward/record.url?scp=85174842190&partnerID=8YFLogxK

U2 - 10.1109/TCSVT.2023.3326293

DO - 10.1109/TCSVT.2023.3326293

M3 - 文章

AN - SCOPUS:85174842190

SN - 1051-8215

VL - 34

SP - 4011

EP - 4026

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

IS - 5

ER -

Toward High-Quality HDR Deghosting with Conditional Diffusion Models

摘要

访问文件

其它文件与链接

指纹

引用此