计算机科学
人工智能
单眼
背景(考古学)
模棱两可
人工神经网络
特征(语言学)
计算机视觉
滤波器(信号处理)
卷积神经网络
古生物学
语言学
哲学
生物
程序设计语言
作者
Shubhra Aich,Jean Marie Uwabeza Vianney,Md Amirul Islam,Mannat Kaur Bingbing Liu
出处
期刊:International Conference on Robotics and Automation
日期:2021-05-30
卷期号:: 11746-11752
被引量:56
标识
DOI:10.1109/icra48506.2021.9560885
摘要
In this paper, we propose a Bidirectional Attention Network (BANet), an end-to-end framework for monocular depth estimation (MDE) that addresses the limitation of effectively integrating local and global information in convolutional neural networks. The structure of this mechanism derives from a strong conceptual foundation of neural machine translation, and presents a light-weight mechanism for adaptive control of computation similar to the dynamic nature of recurrent neural networks. We introduce bidirectional attention modules that utilize the feed-forward feature maps and incorporate the global context to filter out ambiguity. Extensive experiments reveal the high degree of capability of this bidirectional attention model over feed-forward baselines and other state-of-the-art methods for monocular depth estimation on two challenging datasets - KITTI and DIODE. We show that our proposed approach either outperforms or performs at least on a par with the state-of-the-art monocular depth estimation methods with less memory and computational complexity.
科研通智能强力驱动
Strongly Powered by AbleSci AI