计算机科学
纵向
解析
动画
面子(社会学概念)
人工智能
计算机图形学(图像)
计算机视觉
人机交互
艺术
视觉艺术
语言学
哲学
作者
Xuze Tian,Jinshan Zhang,Tao Jiang,Boxi Wu,Meng Xi,Zejian Li,Jianwei Yin
标识
DOI:10.1145/3731715.3733464
摘要
Portrait animation aims to transfer the facial expressions and movements of a target character onto a reference character. This task presents two main challenges: accurately transferring motion and expressions while fully preserving the identity features of the reference portrait. We introduce Vividportraits, a diffusion-based model designed to effectively meet these objectives. In contrast to existing methods that rely on sparse representations such as facial landmarks, our approach leverages facial parsing maps for motion guidance, enabling a more precise conveyance of subtle expressions. A random scaling technique is applied during training to prevent the model from internalizing identity-specific features from the driving images. Furthermore, we perform foreground-background segmentation on the reference portrait to reduce data redundancy. The long-video generation process is refined to improve consistency across sequences. Our model, exclusively trained on public datasets, demonstrates superior performance relative to current state-of-the-art methods, achieving a notable 8% improvement in expression metric. More visual results are available on the anonymous website https://www.vividportraits.cn.
科研通智能强力驱动
Strongly Powered by AbleSci AI