SwinIT: Hierarchical Image-to-Image Translation Framework without Cycle Consistency

计算机科学 图像翻译 人工智能 翻译(生物学) 规范化(社会学) 一致性(知识库) 模式识别(心理学) 数据挖掘 计算机视觉 理论计算机科学 图像(数学) 生物化学 化学 社会学 信使核糖核酸 人类学 基因
作者
Jin Liu,Huiyuan Fu,Xin Wang,Huadóng Ma
出处
期刊:IEEE Transactions on Circuits and Systems for Video Technology [Institute of Electrical and Electronics Engineers]
卷期号:: 1-1
标识
DOI:10.1109/tcsvt.2024.3353932
摘要

Image-to-image (I2I) translation often requires establishing cycle consistency between the source and the translated images across different domains. However, cycle consistency requires redundant reconstruction, and is too restrictive to satisfy the bijection assumption between the two domains. In this paper, we propose SwinIT, a hierarchical Swin-transformer I2I Translation framework without using cycle consistency. Specifically, we carefully design symmetrical encoders for content and style flows, then explore newly proposed adaptive denormalization and normalization strategies. This framework can effectively capture and fuse content and style representations in a coarse-to-fine manner, ensuring our method achieves high performance without cycle consistency. Guided by element-wise feature adaptive denormalization, our model focuses on preserving semantic structure information. Due to the semantic mismatch between unpaired source and exemplar images, we introduce cross-attention adaptive instance normalization to help achieve better alignment. However, because the original optimization objective lacks direct supervision to preserve high-frequency information, rich edge details are lost during the translation. We propose a wavelet transformation matching loss to recover the details by converting the image into multi-frequency parts. We validate our proposed method in various I2I translation tasks, including arbitrary style transfer, multi-modal image synthesis, and semantic image synthesis, demonstrating its effectiveness in both qualitative and quantitative evaluations.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
华贞完成签到,获得积分10
1秒前
lovexa完成签到,获得积分10
1秒前
1秒前
1秒前
3秒前
轮回完成签到,获得积分10
3秒前
Jupiter完成签到,获得积分10
3秒前
3秒前
luobeibei应助黎明锦葵采纳,获得10
4秒前
4秒前
Leo完成签到,获得积分10
4秒前
leaf完成签到 ,获得积分10
4秒前
独钓寒江jc完成签到 ,获得积分10
5秒前
科研狗完成签到,获得积分10
5秒前
合适的毛豆完成签到,获得积分10
5秒前
类囊体薄膜完成签到,获得积分10
6秒前
7秒前
Scidog完成签到,获得积分10
7秒前
落寞电灯胆完成签到,获得积分10
7秒前
Jasper应助狠毒的小龙虾采纳,获得10
7秒前
qyang发布了新的文献求助10
7秒前
believe完成签到,获得积分10
9秒前
潇洒完成签到,获得积分20
10秒前
11秒前
等等发布了新的文献求助10
11秒前
852应助科研通管家采纳,获得10
12秒前
小二郎应助科研通管家采纳,获得10
12秒前
香蕉觅云应助科研通管家采纳,获得10
12秒前
Robert9806完成签到,获得积分10
12秒前
NexusExplorer应助科研通管家采纳,获得10
12秒前
在水一方应助科研通管家采纳,获得10
12秒前
搜集达人应助科研通管家采纳,获得10
12秒前
脑洞疼应助科研通管家采纳,获得10
12秒前
YYYYY应助科研通管家采纳,获得10
12秒前
12秒前
Robert9806发布了新的文献求助10
14秒前
一位名圆完成签到,获得积分10
14秒前
不吃橘子完成签到 ,获得积分10
14秒前
个性的紫菜应助cy采纳,获得20
15秒前
冬天的尔安完成签到 ,获得积分10
15秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Gymnastik für die Jugend 600
Chinese-English Translation Lexicon Version 3.0 500
Electronic Structure Calculations and Structure-Property Relationships on Aromatic Nitro Compounds 500
マンネンタケ科植物由来メロテルペノイド類の網羅的全合成/Collective Synthesis of Meroterpenoids Derived from Ganoderma Family 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 440
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2384758
求助须知:如何正确求助?哪些是违规求助? 2091560
关于积分的说明 5259890
捐赠科研通 1818650
什么是DOI,文献DOI怎么找? 907029
版权声明 559114
科研通“疑难数据库(出版商)”最低求助积分说明 484480