去模糊
计算机科学
图像复原
变压器
概率逻辑
人工智能
计算机视觉
利用
各项异性扩散
图像分辨率
图像处理
图像(数学)
电压
工程类
计算机安全
电气工程
标识
DOI:10.1007/978-981-99-8552-4_20
摘要
Although diffusion models have achieved impressive success for image generation, its application for image restoration is still underexplored. Following tremendous success in natural language processing, transformers have also shown great success for computer vision. Although several researches indicate that increasing transformer depth/width improves the applicability of diffusion models, application of Transformers in diffusion models is still underexplored due to quadratic complexity with the spatial resolution. In this work, we proposed a Transformer-based Denoising Diffusion Probabilistic Model (TransDDPM) for image restoration. With multi-head cross-covariance attention (MXCA), TransDDPM can operates global self-attention with cross-covariance matrix in channel dimension rather than spatial dimension. Another gated feed-forward network (GFFN) is included to enhance the ability to exploit spatial local context. Powered by these designs, TransDDPM is capable for both long-range dependencies and short-range dependencies and flexible for images of various resolutions. Comprehensive experiments demonstrate our TransDDPM achieves state-of-the art performance on several restoration tasks, e.g., image deraining, image dehazing and motion deblurring.
科研通智能强力驱动
Strongly Powered by AbleSci AI