单眼
人工智能
计算机科学
计算机视觉
投影(关系代数)
图像(数学)
钥匙(锁)
构造(python库)
中心(范畴论)
姿势
单目视觉
算法
结晶学
计算机安全
化学
程序设计语言
作者
Junning Zhang,Qunxing Su,Cheng Wang,Hong-qiang Gu
出处
期刊:Neurocomputing
[Elsevier BV]
日期:2020-05-01
卷期号:403: 182-192
被引量:45
标识
DOI:10.1016/j.neucom.2020.03.076
摘要
Abstract We propose MCK-NET, a novel monocular framework for 3D target detection, location and pose estimation in autonomous driving scenarios, in which the key challenge is how to effectively mine depth information from a monocular image. To tackle this ill-posed problem, we combine the relative instance-depth of multiple corners in a monocular image to explicitly construct the corresponding depth relations between interest regions, from which MCK-NET learns to detect and locate objects based on geometric reasoning. In addition, there are two significant features existing in MCK-NET: One is to use the relative relationship between the 2D center and the 3D center projection to help locate the 3D center. The other is that the geometric constraints are established, including semantic keypoints, 2D box center and the projection of 3D center, which can enhance the 3D corner detection and improve the estimation accuracy of 3D centers. Experiments on the KITTI datasets show that MCK-NET achieves the most advanced results in all three tasks and outperforms the current monocular methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI