人工智能
计算机科学
先验概率
匹配(统计)
模式识别(心理学)
特征(语言学)
一致性(知识库)
参数统计
图像(数学)
计算机视觉
数学
贝叶斯概率
语言学
统计
哲学
作者
Jérôme Revaud,Vincent Leroy,Philippe Weinzaepfel,Boris Chidlovskii
标识
DOI:10.1109/cvpr52688.2022.00390
摘要
Existing approaches for learning local image descriptors have shown remarkable achievements in a wide range of geometric tasks. However, most of them require perpixel correspondence-level supervision, which is difficult to acquire at scale and in high quality. In this paper, we propose to explicitly integrate two matching priors in a single loss in order to learn local descriptors without supervision. Given two images depicting the same scene, we extract pixel descriptors and build a correlation volume. The first prior enforces the local consistency of matches in this volume via a pyramidal structure iteratively constructed using a non-parametric module. The second prior exploits the fact that each descriptor should match with at most one descriptor from the other image. We combine our unsupervised loss with a standard self-supervised loss trained from synthetic image augmentations. Feature descriptors learned by the proposed approach outperform their fully- and self-supervised counterparts on various geometric benchmarks such as visual localization and image matching, achieving state-of-the-art performance. Project webpage: https://europe.naverlabs.com/research/3d-vision/pump.
科研通智能强力驱动
Strongly Powered by AbleSci AI