计算机科学
变压器
鉴别器
人工智能
翻译(生物学)
图像翻译
图像分辨率
计算机视觉
图像(数学)
电压
工程类
电气工程
电信
生物化学
化学
探测器
信使核糖核酸
基因
作者
Y. Li,Yaochen Li,Wenneng Tang,Zijian Zhu,Jianlong Yang,Yuehu Liu
标识
DOI:10.1145/3581783.3612518
摘要
The transformer model has gained a lot of success in various computer vision tasks owing to its capacity of modeling long-range dependencies. However, its application has been limited in the area of high-resolution unpaired image translation using GANs due to the quadratic complexity with the spatial resolution of input features. In this paper, we propose a novel transformer-based GAN for high-resolution unpaired image translation named Swin-UNIT. A two-stage generator is designed which consists of a global style translation (GST) module and a recurrent detail supplement (RDS) module. The GST module focuses on translating low-resolution global features using the ability of self-attention. The RDS module offers quick information propagation from the global features to the detail features at a high resolution using cross-attention. Moreover, we customize a dual-branch discriminator to guide the generator. Extensive experiments demonstrate that our model achieves state-of-the-art results on the unpaired image translation tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI