图像翻译
计算机科学
翻译(生物学)
图像(数学)
残余物
相似性(几何)
人工智能
领域(数学分析)
频道(广播)
频域
图像质量
计算机视觉
模式识别(心理学)
算法
数学
基因
信使核糖核酸
生物化学
数学分析
计算机网络
化学
作者
Xin Zhang,Mengning Yang,Haiyang Chai
标识
DOI:10.1109/smc52423.2021.9659053
摘要
Among the major remaining challenges in image-to-image translation is the capacity to generate multi-domain High-Definition images. Recently, the StarGAN is proposed to solve the multi-domain image translation problem. However, the StarGAN focuses on the low-resolution facial image synthesis, leaving some high-frequency information and unsuitable for HD image translation tasks. To address the issue, we propose the Enhanced StarGAN (called EStarGAN), aiming to generate multi-style High-Definition images with fine-grained details simultaneously. Specifically, we propose the Channel-and-Spatial Residual-in-Residual (CSRIR) module to learn high-frequency information in the deep network more effectively. Furthermore, we propose a new reconstruction loss, which consists of mean squared error and Structural Similarity loss to enhance the quality of the generated images. Extensive experiments on real-world datasets prove that our EStarGAN excels baselines with respect to both subjective and objective evaluations.
科研通智能强力驱动
Strongly Powered by AbleSci AI