人工智能
计算机科学
可解释性
单眼
深度图
地点
深度学习
GSM演进的增强数据速率
计算机视觉
模式识别(心理学)
图像(数学)
语言学
哲学
作者
Junfan Wang,Yi Chen,Zhekang Dong,Mingyu Gao,Huipin Lin,Qiheng Miao
标识
DOI:10.1016/j.knosys.2023.110301
摘要
Monocular depth estimation makes it possible for machines to perceive the real world. The prediction performance of the depth estimation network based on deep learning will be affected due to the depth of the deep network and the locality of convolution operations. The imitation of the biological visual system and its functional structure is becoming a research hotspot. In this paper, we study the interpretability relationship between the biological visual system and the monocular depth estimation network. By concretizing the attention mechanism in biological vision, we propose a monocular depth estimation network based on the self-attention mechanism, named SABV-Depth, which can improve prediction accuracy. Inspired by the biological visual interaction mechanism, we focus on the information transfer between each module of the network and improve the information retention ability, and enable the network to output a depth map with rich object information and detailed information. Further, a decoder module with an inner-connection is proposed to recover depth maps with sharp edge contours. Our method is experimentally validated on the KITTI dataset and NYU Depth V2 dataset. The results show that compared with other works, the proposed method improves prediction accuracy. Meanwhile, the depth map has more object information and detail information, and a better edge information processing effect.
科研通智能强力驱动
Strongly Powered by AbleSci AI