计算机科学
卷积神经网络
人工智能
计算
联营
背景(考古学)
匹配(统计)
比例(比率)
棱锥(几何)
特征(语言学)
特征提取
模式识别(心理学)
计算机视觉
姿势
计算复杂性理论
RGB颜色模型
算法
数学
古生物学
语言学
统计
物理
哲学
量子力学
生物
光学
作者
Yanbing Xue,Doudou Zhang,Leida Li,Shiyin Li,Yuxin Wang
标识
DOI:10.1016/j.imavis.2022.104510
摘要
In order to accurately estimate disparities in textureless and slim regions, spatial pyramid pooling and stacked 3D CNN, which can capture global context information, are widely used in state-of-the-art stereo matching algorithms. Unfortunately, the computational complexity and high memory consumption make these methods not friendly to real-time applications such as autonomous driving and augmented realities. In order to balance the real-time performance and accuracy, we design lightweight multi-scale convolutional neural network for real-time stereo matching. First, Lightweight multi-scale 2D and 3D CNN modules are proposed for feature extraction and initial disparity computation respectively. Both of above modules only run on a low resolution to further reduce the amount of calculation. Second, multi-scale RGB images guided network is utilized to refine the final disparity estimation. Experiments on several datasets show that the proposed algorithm can achieve competitive results with speed of 64fps on a NIVDIA 1080 GPU. • The proposed LMNet can balance time efficiency and matching accuracy. • Lightweight MS2D module can extract more global features with less computation. • Lightweight MS3D module yield roubust results especially for slim objects.
科研通智能强力驱动
Strongly Powered by AbleSci AI