编码(内存)
计算机科学
匹配(统计)
人工智能
背景(考古学)
水准点(测量)
计算机视觉
一般化
立体视觉
体积热力学
计算机立体视觉
立体摄像机
算法
几何学
数学
地理
物理
数学分析
统计
量子力学
考古
大地测量学
作者
Gangwei Xu,Xianqi Wang,Xiaohuan Ding,Xin Yang
标识
DOI:10.1109/cvpr52729.2023.02099
摘要
Recurrent All-Pairs Field Transforms (RAFT) has shown great potentials in matching tasks. However, all-pairs correlations lack non-local geometry knowledge and have difficulties tackling local ambiguities in ill-posed regions. In this paper, we propose Iterative Geometry Encoding Volume (IGEV-Stereo), a new deep network architecture for stereo matching. The proposed IGEV-Stereo builds a combined geometry encoding volume that encodes geometry and context information as well as local matching details, and iteratively indexes it to update the disparity map. To speed up the convergence, we exploit GEV to regress an accurate starting point for ConvGRUs iterations. Our IGEV-Stereo ranks 1st on KITTI 2015 and 2012 (Reflective) among all published methods and is the fastest among the top 10 methods. In addition, IGEV-Stereo has strong cross-dataset generalization as well as high inference efficiency. We also extend our IGEV to multi-view stereo (MVS), i.e. IGEV-MVS, which achieves competitive accuracy on DTU benchmark. Code is available at https://github.com/gangweiX/IGEV.
科研通智能强力驱动
Strongly Powered by AbleSci AI