兰萨克
姿势
离群值
人工智能
匹配(统计)
计算机科学
特征(语言学)
三维姿态估计
模式识别(心理学)
可微函数
图形
特征匹配
计算机视觉
图像(数学)
数学
统计
理论计算机科学
哲学
数学分析
语言学
作者
Barbara Roessle,Matthias Nießner
出处
期刊:Cornell University - arXiv
日期:2022-05-03
被引量:1
标识
DOI:10.48550/arxiv.2205.01694
摘要
Erroneous feature matches have severe impact on subsequent camera pose estimation and often require additional, time-costly measures, like RANSAC, for outlier rejection. Our method tackles this challenge by addressing feature matching and pose optimization jointly. To this end, we propose a graph attention network to predict image correspondences along with confidence weights. The resulting matches serve as weighted constraints in a differentiable pose estimation. Training feature matching with gradients from pose optimization naturally learns to down-weight outliers and boosts pose estimation on image pairs compared to SuperGlue by 6.7% on ScanNet. At the same time, it reduces the pose estimation time by over 50% and renders RANSAC iterations unnecessary. Moreover, we integrate information from multiple views by spanning the graph across multiple frames to predict the matches all at once. Multi-view matching combined with end-to-end training improves the pose estimation metrics on Matterport3D by 18.5% compared to SuperGlue.
科研通智能强力驱动
Strongly Powered by AbleSci AI