发布文献求助

Learning From Text: A Multimodal Face Inpainting Network for Irregular Holes

修补计算机科学人工智能面子（社会学概念）计算机视觉自然语言处理模式识别（心理学）图像（数学）语言学哲学

作者

Dandan Zhan,Jiahao Wu,Xing Luo,Zhi Jin

出处

期刊：IEEE Transactions on Circuits and Systems for Video Technology [Institute of Electrical and Electronics Engineers]
日期：2024-02-27 卷期号：34 (8): 7484-7497 被引量：1

标识

DOI：10.1109/tcsvt.2024.3370578

摘要

Irregular hole face inpainting is a challenging task, since the appearance of faces varies greatly (e.g., different expressions and poses) and the human vision is more sensitive to subtle blemishes in the inpainted face images. Without external information, most existing methods struggle to generate new content containing semantic information for face components in the absence of sufficient contextual information. As it is known that text can be used to describe the content of an image in most cases, and is flexible and user-friendly. In this work, a concise and effective Multimodal Face Inpainting Network (MuFIN) is proposed, which simultaneously utilizes the information of the known regions and the descriptive text of the input image to address the problem of irregular hole face inpainting. To fully exploit the rest parts of the corrupted face images, a plug-and-play Multi-scale Multi-level Skip Fusion Module (MMSFM), which extracts multi-scale features and fuses shallow features into deep features at multiple levels, is illustrated. Moreover, to bridge the gap between textual and visual modalities and effectively fuse cross-modal features, a Multi-scale Text-Image Fusion Block (MTIFB), which incorporates text features into image features from both local and global scales, is developed. Extensive experiments conducted on two commonly used datasets CelebA and Multi-Modal-CelebA-HQ demonstrate that our method outperforms state-of-the-art methods both qualitatively and quantitatively, and can generate realistic and controllable results.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

新增更精细的自定义提醒设置 (2026-1-4)

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 机智的寒天发布了新的文献求助10

1秒前; NexusExplorer上传了应助文件

1秒前; 孙文霞完成签到，获得积分10

3秒前; 搜集达人上传了应助文件

3秒前; 星辰大海上传了应助文件

3秒前; 星辰大海上传了应助文件

3秒前; 欣喜豌豆完成签到，获得积分10

4秒前; 快乐科研发布了新的文献求助10

4秒前; 田様的应助被拼搏的败采纳，获得10

4秒前; 承乐上传了应助文件

5秒前; 属下存在感发布了新的文献求助10

5秒前; 杨晓沛完成签到，获得积分10

6秒前; 焱鑫完成签到，获得积分10

6秒前; WXH完成签到，获得积分10

7秒前; 小巧莺发布了新的文献求助10

7秒前; 深情安青上传了应助文件

7秒前; 左手写情发布了新的文献求助30

8秒前; Voyager发布了新的文献求助10

8秒前; 至秦完成签到，获得积分10

10秒前; 考博圣体发布了新的文献求助10

10秒前; 斯文败类上传了应助文件

12秒前; 温暖的沛槐完成签到，获得积分10

12秒前; 李梁关闭了李梁的文献求助

13秒前; 背后的大米发布了新的文献求助10

13秒前; 完美世界上传了应助文件

14秒前; 无极微光上传了应助文件

15秒前; 英俊的铭上传了应助文件

15秒前; 诚心的访蕊完成签到，获得积分10

16秒前; 无情夏槐发布了新的文献求助10

17秒前; 一十六发布了新的文献求助10

18秒前; 遇安发布了新的文献求助10

19秒前; 所所的应助被凸凸采纳，获得10

19秒前; 专注白昼发布了新的文献求助10

19秒前; 领导范儿的应助被koi采纳，获得10

20秒前; Anna上传了应助文件

20秒前; 科研通AI2S上传了应助文件

21秒前; 慈祥的大船完成签到，获得积分10

21秒前; 在水一方的应助被老小孩采纳，获得10

21秒前; hh完成签到，获得积分10

21秒前; 钰天心上传了应助文件

22秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; 人脑智能与人工智能 1000; 花の香りの秘密―遺伝子情報から機能性まで 800; King Tyrant 720; Silicon in Organic, Organometallic, and Polymer Chemistry 500; Principles of Plasma Discharges and Materials Processing, 3rd Edition 400; El poder y la palabra: prensa y poder político en las dictaduras : el régimen de Franco ante la prensa y el periodismo 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 5605558; 求助须知：如何正确求助？哪些是违规求助？ 4690129; 关于积分的说明 14862351; 捐赠科研通 4701941; 什么是DOI，文献DOI怎么找？ 2542175; 邀请新用户注册赠送积分活动 1507804; 关于科研通互助平台的介绍 1472113

今日热心研友

殷勤的紫槐

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通