Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution

Qingping Zheng; Ling Zheng; Yuanfan Guo; Ying Li; Songcen Xu; Jiankang Deng; Hang Xu

doi:10.1109/CVPR52733.2024.02438

Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution

Qingping Zheng, Ling Zheng, Yuanfan Guo, Ying Li, Songcen Xu, Jiankang Deng, Hang Xu

School of Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

6 Scopus citations

Abstract

Artifact-free super-resolution (SR) aims to translate low-resolution images into their high-resolution counterparts with a strict integrity of the original content, eliminating any distortions or synthetic details. While traditional diffusion-based SR techniques have demonstrated remarkable abilities to enhance image detail, they are prone to ar-tifact introduction during iterative procedures. Such arti-facts, ranging from trivial noise to unauthentic textures, de-viate from the true structure of the source image, thus chal-lenging the integrity of the super-resolution process. In this work, we propose Self-Adaptive Reality-Guided Diffusion (SARGD), a training-free method that delves into the latent space to effectively identify and mitigate the propagation of artifacts. Our SARGD begins by using an artifact detector to identify implausible pixels, creating a binary mask that highlights artifacts. Following this, the Reality Guidance Refinement (RGR) process refines artifacts by integrating this mask with realistic latent representations, improving alignment with the original image. Nonetheless, initial realistic-latent representations from lower-quality images result in over-smoothing in the final output. To address this, we introduce a Self-Adaptive Guidance (SAG) mechanism. It dynamically computes a reality score, enhancing the sharpness of the realistic latent. These alternating mechanisms collectively achieve artifact-free super-resolution. Extensive experiments demonstrate the superiority of our method, delivering detailed artifact-free high-resolution images while reducing sampling steps by 2 x. We release our code at https://github.com/ProAirVerse/Self-Adaptive-Guidance-Diffusion.git.

Original language	English
Title of host publication	Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024
Publisher	IEEE Computer Society
Pages	25806-25816
Number of pages	11
ISBN (Electronic)	9798350353006
DOIs	https://doi.org/10.1109/CVPR52733.2024.02438
State	Published - 2024
Event	2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024 - Seattle, United States Duration: 16 Jun 2024 → 22 Jun 2024

Publication series

Name	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISSN (Print)	1063-6919

Conference

Conference	2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024
Country/Territory	United States
City	Seattle
Period	16/06/24 → 22/06/24

Access to Document

10.1109/CVPR52733.2024.02438

Cite this

Zheng, Q., Zheng, L., Guo, Y., Li, Y., Xu, S., Deng, J., & Xu, H. (2024). Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution. In Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024 (pp. 25806-25816). (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). IEEE Computer Society. https://doi.org/10.1109/CVPR52733.2024.02438

@inproceedings{198a32d26e7c4f7ebe0d13e7a7a5a207,

title = "Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution",

abstract = "Artifact-free super-resolution (SR) aims to translate low-resolution images into their high-resolution counterparts with a strict integrity of the original content, eliminating any distortions or synthetic details. While traditional diffusion-based SR techniques have demonstrated remarkable abilities to enhance image detail, they are prone to ar-tifact introduction during iterative procedures. Such arti-facts, ranging from trivial noise to unauthentic textures, de-viate from the true structure of the source image, thus chal-lenging the integrity of the super-resolution process. In this work, we propose Self-Adaptive Reality-Guided Diffusion (SARGD), a training-free method that delves into the latent space to effectively identify and mitigate the propagation of artifacts. Our SARGD begins by using an artifact detector to identify implausible pixels, creating a binary mask that highlights artifacts. Following this, the Reality Guidance Refinement (RGR) process refines artifacts by integrating this mask with realistic latent representations, improving alignment with the original image. Nonetheless, initial realistic-latent representations from lower-quality images result in over-smoothing in the final output. To address this, we introduce a Self-Adaptive Guidance (SAG) mechanism. It dynamically computes a reality score, enhancing the sharpness of the realistic latent. These alternating mechanisms collectively achieve artifact-free super-resolution. Extensive experiments demonstrate the superiority of our method, delivering detailed artifact-free high-resolution images while reducing sampling steps by 2 x. We release our code at https://github.com/ProAirVerse/Self-Adaptive-Guidance-Diffusion.git.",

author = "Qingping Zheng and Ling Zheng and Yuanfan Guo and Ying Li and Songcen Xu and Jiankang Deng and Hang Xu",

note = "Publisher Copyright: {\textcopyright} 2024 IEEE.; 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024 ; Conference date: 16-06-2024 Through 22-06-2024",

year = "2024",

doi = "10.1109/CVPR52733.2024.02438",

language = "英语",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "25806--25816",

booktitle = "Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024",

}

Zheng, Q, Zheng, L, Guo, Y, Li, Y, Xu, S, Deng, J & Xu, H 2024, Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution. in Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, IEEE Computer Society, pp. 25806-25816, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024, Seattle, United States, 16/06/24. https://doi.org/10.1109/CVPR52733.2024.02438

Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution. / Zheng, Qingping; Zheng, Ling; Guo, Yuanfan et al.
Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024. IEEE Computer Society, 2024. p. 25806-25816 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

TY - GEN

T1 - Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution

AU - Zheng, Qingping

AU - Zheng, Ling

AU - Guo, Yuanfan

AU - Li, Ying

AU - Xu, Songcen

AU - Deng, Jiankang

AU - Xu, Hang

PY - 2024

Y1 - 2024

N2 - Artifact-free super-resolution (SR) aims to translate low-resolution images into their high-resolution counterparts with a strict integrity of the original content, eliminating any distortions or synthetic details. While traditional diffusion-based SR techniques have demonstrated remarkable abilities to enhance image detail, they are prone to ar-tifact introduction during iterative procedures. Such arti-facts, ranging from trivial noise to unauthentic textures, de-viate from the true structure of the source image, thus chal-lenging the integrity of the super-resolution process. In this work, we propose Self-Adaptive Reality-Guided Diffusion (SARGD), a training-free method that delves into the latent space to effectively identify and mitigate the propagation of artifacts. Our SARGD begins by using an artifact detector to identify implausible pixels, creating a binary mask that highlights artifacts. Following this, the Reality Guidance Refinement (RGR) process refines artifacts by integrating this mask with realistic latent representations, improving alignment with the original image. Nonetheless, initial realistic-latent representations from lower-quality images result in over-smoothing in the final output. To address this, we introduce a Self-Adaptive Guidance (SAG) mechanism. It dynamically computes a reality score, enhancing the sharpness of the realistic latent. These alternating mechanisms collectively achieve artifact-free super-resolution. Extensive experiments demonstrate the superiority of our method, delivering detailed artifact-free high-resolution images while reducing sampling steps by 2 x. We release our code at https://github.com/ProAirVerse/Self-Adaptive-Guidance-Diffusion.git.

AB - Artifact-free super-resolution (SR) aims to translate low-resolution images into their high-resolution counterparts with a strict integrity of the original content, eliminating any distortions or synthetic details. While traditional diffusion-based SR techniques have demonstrated remarkable abilities to enhance image detail, they are prone to ar-tifact introduction during iterative procedures. Such arti-facts, ranging from trivial noise to unauthentic textures, de-viate from the true structure of the source image, thus chal-lenging the integrity of the super-resolution process. In this work, we propose Self-Adaptive Reality-Guided Diffusion (SARGD), a training-free method that delves into the latent space to effectively identify and mitigate the propagation of artifacts. Our SARGD begins by using an artifact detector to identify implausible pixels, creating a binary mask that highlights artifacts. Following this, the Reality Guidance Refinement (RGR) process refines artifacts by integrating this mask with realistic latent representations, improving alignment with the original image. Nonetheless, initial realistic-latent representations from lower-quality images result in over-smoothing in the final output. To address this, we introduce a Self-Adaptive Guidance (SAG) mechanism. It dynamically computes a reality score, enhancing the sharpness of the realistic latent. These alternating mechanisms collectively achieve artifact-free super-resolution. Extensive experiments demonstrate the superiority of our method, delivering detailed artifact-free high-resolution images while reducing sampling steps by 2 x. We release our code at https://github.com/ProAirVerse/Self-Adaptive-Guidance-Diffusion.git.

UR - http://www.scopus.com/inward/record.url?scp=85207022406&partnerID=8YFLogxK

U2 - 10.1109/CVPR52733.2024.02438

DO - 10.1109/CVPR52733.2024.02438

M3 - 会议稿件

AN - SCOPUS:85207022406

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 25806

EP - 25816

BT - Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024

PB - IEEE Computer Society

T2 - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024

Y2 - 16 June 2024 through 22 June 2024

ER -

Zheng Q, Zheng L, Guo Y, Li Y, Xu S, Deng J et al. Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution. In Proceedings - 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2024. IEEE Computer Society. 2024. p. 25806-25816. (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition). doi: 10.1109/CVPR52733.2024.02438

Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this