修补
计算机科学
人工智能
图像(数学)
傅里叶变换
领域(数学)
编码(集合论)
航程(航空)
计算机视觉
深度学习
快速傅里叶变换
算法
数学
集合(抽象数据类型)
数学分析
复合材料
材料科学
程序设计语言
纯数学
作者
Roman Suvorov,Elizaveta Logacheva,Anton Mashikhin,Anastasia Remizova,Arsenii Ashukha,Aleksei Silvestrov,Naejin Kong,Harshith Goka,Kiwoong Park,Victor Lempitsky
标识
DOI:10.1109/wacv51458.2022.00323
摘要
Modern image inpainting systems, despite the significant progress, often struggle with large missing areas, complex geometric structures, and high-resolution images. We find that one of the main reasons for that is the lack of an effective receptive field in both the inpainting network and the loss function. To alleviate this issue, we propose a new method called large mask inpainting (LaMa). LaMa is based on i) a new inpainting network architecture that uses fast Fourier convolutions (FFCs), which have the image-wide receptive field; ii) a high receptive field perceptual loss; iii) large training masks, which unlocks the potential of the first two components. Our inpainting network improves the state-of-the-art across a range of datasets and achieves excellent performance even in challenging scenarios, e.g. completion of periodic structures. Our model generalizes surprisingly well to resolutions that are higher than those seen at train time, and achieves this at lower parameter&time costs than the competitive baselines. The code is available at https://github.com/saic-mdal/lama.
科研通智能强力驱动
Strongly Powered by AbleSci AI