TY - JOUR
T1 - RLI-DM
T2 - Robust Layout-Based Iterative Diffusion Model for SAR-to-RGB Image Translation
AU - Zhao, Bingxuan
AU - Yang, Chuang
AU - Zhou, Qing
AU - Wang, Qi
N1 - Publisher Copyright:
© 1980-2012 IEEE.
PY - 2025
Y1 - 2025
N2 - Synthetic aperture radar (SAR)-to-RGB translation, which transforms SAR images into visually interpretable RGB counterparts, is critical for enhancing applications in visual analysis, deep learning, and multisource data fusion. However, existing methods often fail to preserve both global structural integrity and fine-grained local textures. This deficiency stems from weak feature extraction and the lack of a robust layout framework, leading to outputs with information loss, geometric distortions, and unnatural textures. To overcome these limitations, we propose the robust layout-based iterative diffusion model (RLI-DM), a novel three-stage framework for high-fidelity translation. The framework begins with an optical reconstruction module that employs a conditional diffusion model (DM) to ensure precise spectral mapping. At its core, the geometric robustness module (GRM) leverages a Brownian bridge model that we train to derive a noise-resilient layout, overcoming the limitations of conventional edge detection and significantly enhancing global structural fidelity. Finally, this robust layout guides a customized multilevel refinement module (CMRM) to iteratively reconstruct local textures, ensuring structural clarity and cross-feature consistency. Extensive experiments on multiple benchmark datasets demonstrate that RLI-DM achieves state-of-the-art performance, significantly outperforming existing methods in both structural integrity and perceptual quality.
AB - Synthetic aperture radar (SAR)-to-RGB translation, which transforms SAR images into visually interpretable RGB counterparts, is critical for enhancing applications in visual analysis, deep learning, and multisource data fusion. However, existing methods often fail to preserve both global structural integrity and fine-grained local textures. This deficiency stems from weak feature extraction and the lack of a robust layout framework, leading to outputs with information loss, geometric distortions, and unnatural textures. To overcome these limitations, we propose the robust layout-based iterative diffusion model (RLI-DM), a novel three-stage framework for high-fidelity translation. The framework begins with an optical reconstruction module that employs a conditional diffusion model (DM) to ensure precise spectral mapping. At its core, the geometric robustness module (GRM) leverages a Brownian bridge model that we train to derive a noise-resilient layout, overcoming the limitations of conventional edge detection and significantly enhancing global structural fidelity. Finally, this robust layout guides a customized multilevel refinement module (CMRM) to iteratively reconstruct local textures, ensuring structural clarity and cross-feature consistency. Extensive experiments on multiple benchmark datasets demonstrate that RLI-DM achieves state-of-the-art performance, significantly outperforming existing methods in both structural integrity and perceptual quality.
KW - Diffusion model (DM)
KW - iterative refinement
KW - remote sensing
KW - robust layout
KW - synthetic aperture radar (SAR)-to-RGB
UR - https://www.scopus.com/pages/publications/105017436788
U2 - 10.1109/TGRS.2025.3613938
DO - 10.1109/TGRS.2025.3613938
M3 - 文章
AN - SCOPUS:105017436788
SN - 0196-2892
VL - 63
JO - IEEE Transactions on Geoscience and Remote Sensing
JF - IEEE Transactions on Geoscience and Remote Sensing
M1 - 5108009
ER -