计算机科学
人工智能
计算机视觉
扩散
模式识别(心理学)
热力学
物理
作者
Chunming He,Yuqi Shen,Chi-Chun Fang,Fengyang Xiao,Longxiang Tang,Yulun Zhang,Wangmeng Zuo,Zhenhua Guo,Xiu Li
标识
DOI:10.1109/tpami.2025.3545047
摘要
Deep generative models have gained considerable attention in low-level vision tasks due to their powerful generative capabilities. Among these, diffusion model-based approaches, which employ a forward diffusion process to degrade an image and a reverse denoising process for image generation, have become particularly prominent for producing high-quality, diverse samples with intricate texture details. Despite their widespread success in low-level vision, there remains a lack of a comprehensive, insightful survey that synthesizes and organizes the advances in diffusion model-based techniques. To address this gap, this paper presents the first comprehensive review focused on denoising diffusion models applied to low-level vision tasks, covering both theoretical and practical contributions. We outline three general diffusion modeling frameworks and explore their connections with other popular deep generative models, establishing a solid theoretical foundation for subsequent analysis. We then categorize diffusion models used in low-level vision tasks from multiple perspectives, considering both the underlying framework and the target application. Beyond natural image processing, we also summarize diffusion models applied to other low-level vision domains, including medical imaging, remote sensing, and video processing. Additionally, we provide an overview of widely used benchmarks and evaluation metrics in low-level vision tasks. Our review includes an extensive evaluation of diffusion model-based techniques across six representative tasks, with both quantitative and qualitative analysis. Finally, we highlight the limitations of current diffusion models and propose four promising directions for future research. This comprehensive review aims to foster a deeper understanding of the role of denoising diffusion models in low-level vision. For those interested, a curated list of diffusion model-based techniques, datasets, and related information across over 20 low-level vision tasks is available at https://github.com/ChunmingHe/awesome-diffusion-models-in-low-level-vision.
科研通智能强力驱动
Strongly Powered by AbleSci AI