计算机科学
卷积(计算机科学)
人工智能
棱锥(几何)
特征(语言学)
推论
计算机视觉
编码(集合论)
算法
人工神经网络
数学
语言学
哲学
几何学
集合(抽象数据类型)
程序设计语言
作者
Pengfei Jiang,Xin Yang,Yuanjie Chen,Wenjie Song,Yang Li
标识
DOI:10.1016/j.cag.2023.08.014
摘要
Multi-View Stereo (MVS) is a crucial technique for reconstructing the geometric structure of a scene, given the known camera parameters. Previous deep learning-based MVS methods have mainly focused on improving the reconstruction quality but overlooked the running efficiency during the actual algorithm deployment. For example, deformable convolutions have been introduced to improve the accuracy of the reconstruction results further, however, its inability for parallel optimization caused low inference speed. In this paper, we propose AdaptMVSNet which is device-friendly and reconstruction-efficient, while preserving the original results. To this end, adaptive convolution is introduced to significantly improve the efficiency in speed and metrics compared to current methods. In addition, an attention fusion module is proposed to blend features from adaptive convolution and the feature pyramid network. Our experiments demonstrate that our proposed approach achieves state-of-the-art performance and is almost 2× faster than the recent fastest MVS method. We will release our source code.
科研通智能强力驱动
Strongly Powered by AbleSci AI