规范化(社会学)
生成语法
人工智能
计算机科学
翻译(生物学)
图像翻译
模式识别(心理学)
图像(数学)
自然语言处理
生物
社会学
信使核糖核酸
人类学
生物化学
基因
作者
Junho Kim,Minjae Kim,Hyeon-Woo Kang,Kwang-Hee Lee
出处
期刊:Cornell University - arXiv
日期:2019-01-01
被引量:265
标识
DOI:10.48550/arxiv.1907.10830
摘要
We propose a novel method for unsupervised image-to-image translation, which incorporates a new attention module and a new learnable normalization function in an end-to-end manner. The attention module guides our model to focus on more important regions distinguishing between source and target domains based on the attention map obtained by the auxiliary classifier. Unlike previous attention-based method which cannot handle the geometric changes between domains, our model can translate both images requiring holistic changes and images requiring large shape changes. Moreover, our new AdaLIN (Adaptive Layer-Instance Normalization) function helps our attention-guided model to flexibly control the amount of change in shape and texture by learned parameters depending on datasets. Experimental results show the superiority of the proposed method compared to the existing state-of-the-art models with a fixed network architecture and hyper-parameters. Our code and datasets are available at https://github.com/taki0112/UGATIT or https://github.com/znxlwm/UGATIT-pytorch.
科研通智能强力驱动
Strongly Powered by AbleSci AI