发布文献求助

Lumiere: A Space-Time Diffusion Model for Video Generation

计算机科学视频压缩图片类型视频编辑视频处理修补计算机视觉人工智能视频制作视频后处理帧速率程式化事实视频跟踪帧（网络）时间分辨率计算机图形学（图像）图像（数学）多媒体物理量子力学电信经济宏观经济学

作者

Omer Bar-Tal,Hila Chefer,Omer Tov,Charles Herrmann,Roni Paiss,Shiran Zada,Ariel Ephrat,Junhwa Hur,Yuanzhen Li,Tomer Michaeli,Oliver Wang,Deqing Sun,Tali Dekel,Inbar Mosseri

出处

期刊：Cornell University - arXiv 日期：2024-01-01 被引量：14

链接

arxiv.org datacite.orgdoi.org

标识

DOI：10.48550/arxiv.2401.12945

摘要

We introduce Lumiere -- a text-to-video diffusion model designed for synthesizing videos that portray realistic, diverse and coherent motion -- a pivotal challenge in video synthesis. To this end, we introduce a Space-Time U-Net architecture that generates the entire temporal duration of the video at once, through a single pass in the model. This is in contrast to existing video models which synthesize distant keyframes followed by temporal super-resolution -- an approach that inherently makes global temporal consistency difficult to achieve. By deploying both spatial and (importantly) temporal down- and up-sampling and leveraging a pre-trained text-to-image diffusion model, our model learns to directly generate a full-frame-rate, low-resolution video by processing it in multiple space-time scales. We demonstrate state-of-the-art text-to-video generation results, and show that our design easily facilitates a wide range of content creation tasks and video editing applications, including image-to-video, video inpainting, and stylized generation.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

建议保存本图，每天支付宝扫一扫（相册选取）领红包

实时播报: 隐形曼青上传了应助文件

1秒前; 烟花的应助被xiaojuan采纳，获得10

2秒前; wangyang发布了新的文献求助10

2秒前; wdfy发布了新的文献求助10

2秒前; 咸的豆包儿完成签到，获得积分10

2秒前; 小二郎上传了应助文件

3秒前; 荒天帝发布了新的文献求助10

3秒前; 脑洞疼上传了应助文件

4秒前; 米粒发布了新的文献求助10

5秒前; 烟花上传了应助文件

5秒前; 所所上传了应助文件

6秒前; cccdida完成签到，获得积分10

6秒前; 田様的应助被ice采纳，获得10

7秒前; pluto上传了应助文件

7秒前; JamesPei的应助被jrzsy采纳，获得10

7秒前; 共享精神的应助被归燕采纳，获得10

7秒前; AN的应助被长天留影采纳，获得30

8秒前; HBY发布了新的文献求助10

8秒前; pliciyir发布了新的文献求助20

9秒前; xiaojuan发布了新的文献求助10

9秒前; 小点点发布了新的文献求助10

9秒前; 在水一方上传了应助文件

9秒前; 十点差一分发布了新的文献求助20

9秒前; 红星路吃饼子的派大星完成签到，获得积分10

11秒前; 喵喵发布了新的文献求助10

11秒前; dd完成签到，获得积分10

11秒前; ycw123完成签到，获得积分10

11秒前; 科研小白发布了新的文献求助10

12秒前; 皓民完成签到，获得积分10

13秒前; ValiantFrank发布了新的文献求助10

14秒前; 我是老大上传了应助文件

14秒前; 李健上传了应助文件

17秒前; 人类高血压女性发布了新的文献求助10

17秒前; 善学以致用上传了应助文件

17秒前; 魄魄olm完成签到，获得积分10

18秒前; 归燕完成签到，获得积分10

18秒前; z1z1z关闭了z1z1z的文献求助

18秒前; 共享精神上传了应助文件

19秒前; 溜了溜了完成签到，获得积分10

19秒前; 小潘同学完成签到，获得积分10

19秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; Mentoring for Wellbeing in Schools 1200; List of 1,091 Public Pension Profiles by Region 1061; Binary Alloy Phase Diagrams, 2nd Edition 600; Atlas of Liver Pathology: A Pattern-Based Approach 500; A Technologist’s Guide to Performing Sleep Studies 500; EEG in Childhood Epilepsy: Initial Presentation & Long-Term Follow-Up 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 5496714; 求助须知：如何正确求助？哪些是违规求助？ 4594253; 关于积分的说明 14444133; 捐赠科研通 4526872; 什么是DOI，文献DOI怎么找？ 2480505; 邀请新用户注册赠送积分活动 1465029; 关于科研通互助平台的介绍 1437742

今日热心研友

你嵙这个期刊没买

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通