Designing an encoder for StyleGAN image manipulation

计算机科学 编码器 发电机(电路理论) 失真(音乐) 人工智能 计算机视觉 图像编辑 图像(数学) 源代码 图像质量 反演(地质) 构造盆地 操作系统 生物 物理 量子力学 古生物学 功率(物理) 放大器 带宽(计算) 计算机网络
作者
Omer Tov,Yuval Alaluf,Yotam Nitzan,Or Patashnik,Daniel Cohen‐Or
出处
期刊:ACM Transactions on Graphics [Association for Computing Machinery]
卷期号:40 (4): 1-14 被引量:468
标识
DOI:10.1145/3450626.3459838
摘要

Recently, there has been a surge of diverse methods for performing image editing by employing pre-trained unconditional generators. Applying these methods on real images, however, remains a challenge, as it necessarily requires the inversion of the images into their latent space. To successfully invert a real image, one needs to find a latent code that reconstructs the input image accurately, and more importantly, allows for its meaningful manipulation. In this paper, we carefully study the latent space of StyleGAN, the state-of-the-art unconditional generator. We identify and analyze the existence of a distortion-editability tradeoff and a distortion-perception tradeoff within the StyleGAN latent space. We then suggest two principles for designing encoders in a manner that allows one to control the proximity of the inversions to regions that StyleGAN was originally trained on. We present an encoder based on our two principles that is specifically designed for facilitating editing on real images by balancing these tradeoffs. By evaluating its performance qualitatively and quantitatively on numerous challenging domains, including cars and horses, we show that our inversion method, followed by common editing techniques, achieves superior real-image editing quality, with only a small reconstruction accuracy drop.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
重要的凝竹完成签到,获得积分10
刚刚
烟花应助ZM采纳,获得10
刚刚
1秒前
小轩窗zst发布了新的文献求助10
1秒前
水1111完成签到,获得积分20
2秒前
5秒前
5秒前
6秒前
小轩窗zst完成签到,获得积分10
7秒前
chenzhuod发布了新的文献求助10
11秒前
orixero应助天天看文献采纳,获得10
11秒前
飞天小女警应助念之采纳,获得20
11秒前
13秒前
16秒前
Ava应助zhang采纳,获得10
16秒前
17秒前
17秒前
CodeCraft应助爱听歌的孤容采纳,获得10
18秒前
范垂钦发布了新的文献求助10
21秒前
董研完成签到,获得积分10
22秒前
繁荣的青旋完成签到 ,获得积分10
23秒前
Liu完成签到,获得积分10
23秒前
Asura完成签到,获得积分10
23秒前
23秒前
斐嘿嘿发布了新的文献求助10
23秒前
26秒前
27秒前
完美世界应助熙熙沅沅采纳,获得10
28秒前
29秒前
111发布了新的文献求助10
30秒前
30秒前
镜中男人发布了新的文献求助20
30秒前
31秒前
灵巧迎夏完成签到,获得积分10
33秒前
南冥完成签到 ,获得积分10
33秒前
33秒前
幺儿小桃酥完成签到,获得积分10
34秒前
科研通AI5应助Dopamine采纳,获得10
34秒前
36秒前
飞天小女警应助Yi采纳,获得10
36秒前
高分求助中
Les Mantodea de Guyane Insecta, Polyneoptera 2500
Mobilization, center-periphery structures and nation-building 600
Technologies supporting mass customization of apparel: A pilot project 450
China—Art—Modernity: A Critical Introduction to Chinese Visual Expression from the Beginning of the Twentieth Century to the Present Day 430
Tip60 complex regulates eggshell formation and oviposition in the white-backed planthopper, providing effective targets for pest control 400
A Field Guide to the Amphibians and Reptiles of Madagascar - Frank Glaw and Miguel Vences - 3rd Edition 400
China Gadabouts: New Frontiers of Humanitarian Nursing, 1941–51 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3792160
求助须知:如何正确求助?哪些是违规求助? 3336436
关于积分的说明 10280990
捐赠科研通 3053122
什么是DOI,文献DOI怎么找? 1675474
邀请新用户注册赠送积分活动 803469
科研通“疑难数据库(出版商)”最低求助积分说明 761414