水下
图像(数学)
变压器
图像增强
计算机科学
人工智能
计算机视觉
电气工程
地质学
工程类
电压
海洋学
作者
Xingyang Nie,Su Pan,Xiaoyu Zhai,Shifei Tao,Fengzhong Qu,Biao Wang,Huilin Ge,Guojie Xiao
出处
期刊:Cornell University - arXiv
日期:2024-07-07
被引量:1
标识
DOI:10.48550/arxiv.2407.05389
摘要
Underwater image enhancement (UIE) has attracted much attention owing to its importance for underwater operation and marine engineering. Motivated by the recent advance in generative models, we propose a novel UIE method based on image-conditional diffusion transformer (ICDT). Our method takes the degraded underwater image as the conditional input and converts it into latent space where ICDT is applied. ICDT replaces the conventional U-Net backbone in a denoising diffusion probabilistic model (DDPM) with a transformer, and thus inherits favorable properties such as scalability from transformers. Furthermore, we train ICDT with a hybrid loss function involving variances to achieve better log-likelihoods, which meanwhile significantly accelerates the sampling process. We experimentally assess the scalability of ICDTs and compare with prior works in UIE on the Underwater ImageNet dataset. Besides good scaling properties, our largest model, ICDT-XL/2, outperforms all comparison methods, achieving state-of-the-art (SOTA) quality of image enhancement.
科研通智能强力驱动
Strongly Powered by AbleSci AI