发布文献求助

CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior

计算机科学计算机人脸动画动画人工智能自回归模型代码本面部运动捕捉计算机视觉语音识别计算机动画模式识别（心理学）人脸检测面部识别系统计算机图形学（图像）数学计量经济学

作者

Jinbo Xing,Menghan Xia,Yuechen Zhang,Xiaodong Cun,Jue Wang,Tien‐Tsin Wong

链接

arxiv.org arxiv.orgdoi.org

标识

DOI：10.1109/cvpr52729.2023.01229

摘要

Speech-driven 3D facial animation has been widely studied, yet there is still a gap to achieving realism and vividness due to the highly ill-posed nature and scarcity of audio-visual data. Existing works typically formulate the cross-modal mapping into a regression task, which suffers from the regression-to-mean problem leading to over-smoothed facial motions. In this paper, we propose to cast speech-driven facial animation as a code query task in a finite proxy space of the learned codebook, which effectively promotes the vividness of the generated motions by reducing the cross-modal mapping uncertainty. The codebook is learned by self-reconstruction over real facial motions and thus embedded with realistic facial motion priors. Over the discrete motion space, a temporal autoregressive model is employed to sequentially synthesize facial motions from the input speech signal, which guarantees lip-sync as well as plausible facial expressions. We demonstrate that our approach outperforms current state-of-the-art methods both qualitatively and quantitatively. Also, a user study further justifies our superiority in perceptual quality. Code and video demo are available at https://doubiiu.github.io/projects/codetalker.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: Orange上传了应助文件

1秒前; 汉堡包上传了应助文件

1秒前; 重楼又上一支蒿发布了新的文献求助10

1秒前; CodeCraft上传了应助文件

1秒前; 落寞寒荷的应助被落寞海云采纳，获得10

1秒前; 背后的穆发布了新的文献求助10

2秒前; 科研狗完成签到，获得积分10

3秒前; 小孙完成签到，获得积分10

3秒前; 哈哈哈发布了新的文献求助10

3秒前; 完美世界上传了应助文件

4秒前; 幽默的月光完成签到，获得积分10

4秒前; 正在学习发布了新的文献求助10

4秒前; 顺顺发布了新的文献求助20

5秒前; 娜娜完成签到，获得积分10

5秒前; 科研通AI5的应助被酷酷的碧凡采纳，获得10

5秒前; 02发布了新的文献求助10

5秒前; 科研小白完成签到，获得积分10

5秒前; 刘大喜发布了新的文献求助10

6秒前; Akim的应助被小巧雪糕采纳，获得10

7秒前; llllllu完成签到，获得积分20

8秒前; 小蘑菇上传了应助文件

8秒前; 3139813319完成签到，获得积分10

9秒前; Mengyue完成签到，获得积分10

9秒前; 长颈鹿没有脖子完成签到，获得积分20

9秒前; 三清小爷完成签到，获得积分10

9秒前; 乐宝完成签到，获得积分10

9秒前; Dipsy完成签到，获得积分10

10秒前; x笑一完成签到，获得积分10

10秒前; 建安发布了新的文献求助10

10秒前; WW发布了新的文献求助10

10秒前; 哦吼完成签到，获得积分10

10秒前; 搜集达人上传了应助文件

11秒前; 漂亮的翠绿完成签到，获得积分10

11秒前; 科研通AI5的应助被刘大喜采纳，获得30

11秒前; Khr1stINK完成签到，获得积分10

12秒前; 狒狒爱学习完成签到，获得积分10

12秒前; 朱道斌完成签到，获得积分10

13秒前; AAA建材王哥完成签到，获得积分10

13秒前; CodeCraft上传了应助文件

14秒前; 李健的粉丝团团长上传了应助文件

14秒前

高分求助中: Handbook of Diagnosis and Treatment of DSM-5-TR Personality Disorders 800; Algorithmic Mathematics in Machine Learning 500; Разработка метода ускоренного контроля качества электрохромных устройств 500; Advances in Underwater Acoustics, Structural Acoustics, and Computational Methodologies 400; 建筑材料检测与应用 370; Getting Published in SSCI Journals: 200+ Questions and Answers for Absolute Beginners 300; The Monocyte-to-HDL ratio (MHR) as a prognostic and diagnostic biomarker in Acute Ischemic Stroke: A systematic review with meta-analysis (P9-14.010) 240

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3830824; 求助须知：如何正确求助？哪些是违规求助？ 3373141; 关于积分的说明 10478298; 捐赠科研通 3093303; 什么是DOI，文献DOI怎么找？ 1702447; 邀请新用户注册赠送积分活动 819066; 科研通“疑难数据库（出版商）”最低求助积分说明 771232

今日热心研友

可千万不要躺平呀

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通