计算机科学
稳健性(进化)
人工智能
合成孔径雷达
计算机视觉
特征提取
迭代法
模式识别(心理学)
算法
合成数据
数据建模
翻译(生物学)
实体造型
特征(语言学)
迭代最近点
RGB颜色模型
迭代重建
几何本原
杂乱
图像配准
模块化设计
噪音(视频)
GSM演进的增强数据速率
自动目标识别
光流
增采样
水准点(测量)
雷达成像
匹配(统计)
图像处理
作者
Bingxuan Zhao,Chuang Yang,Qing Zhou,Qi Wang
标识
DOI:10.1109/tgrs.2025.3613938
摘要
SAR-to-RGB translation, which transforms Synthetic Aperture Radar (SAR) images into visually interpretable RGB counterparts, is critical for enhancing applications in visual analysis, deep learning, and multi-source data fusion. However, existing methods often fail to preserve both global structural integrity and fine-grained local textures. This deficiency stems from weak feature extraction and the lack of a robust layout framework, leading to outputs with information loss, geometric distortions, and unnatural textures. To overcome these limitations, we propose the Robust Layout-based Iterative Diffusion Model (RLI-DM), a novel three-stage framework for high-fidelity translation. The framework begins with an Optical Reconstruction Module that employs a conditional diffusion model to ensure precise spectral mapping. At its core, the Geometric Robustness Module leverages a Brownian bridge model that we train to derive a noise-resilient layout, overcoming the limitations of conventional edge detection and significantly enhancing global structural fidelity. Finally, this robust layout guides a Customized Multi-Level Refinement Module to iteratively reconstruct local textures, ensuring structural clarity and cross-feature consistency. Extensive experiments on multiple benchmark datasets demonstrate that RLI-DM achieves state-of-the-art performance, significantly outperforming existing methods in both structural integrity and perceptual quality.
科研通智能强力驱动
Strongly Powered by AbleSci AI