单眼
人工智能
计算机科学
计算机视觉
杠杆(统计)
激光雷达
深度知觉
单目视觉
深度学习
无人机
深度图
感知
图像(数学)
地理
遥感
生物
神经科学
遗传学
作者
Jeevana Jyothi Taviti,Dheeraj Mekala,Bhargava Rama Chilukuri
出处
期刊:Research Square - Research Square
日期:2024-11-21
标识
DOI:10.21203/rs.3.rs-5374429/v1
摘要
Abstract Depth estimation is crucial for computer vision applications like autonomous driving. While traditional methods such as LiDAR and radar are expensive, making monocular depth estimation a more cost-efficient alternative. However, deriving accurate depth from a single image is challenging due to its under-constrained nature. Monocular cues like perspective, scaling, and occlusion aid human depth perception, which deep learning-based models leverage to map image features to depth values. This research addresses the complexities of monocular depth estimation in mixed traffic conditions commonly found on Indian roads, with diverse vehicle classes, road surfaces, and unpredictable obstacles. Traditional methods often struggle in these scenarios. To overcome this, our study integrates object detection with deep learning models to estimate vehicle distances from frontal camera views. Validated using dashcam and drone footage, the proposed approach achieves an RMSE below 4 meters for both training and testing datasets. Moreover, the ensemble models reduced RMSE by up to 60% and improved the \(\textnormal{R}^\textnormal{2}\) value by 40%. This solution significantly enhances the spatial awareness of autonomous vehicles, providing a robust means of navigating heterogeneous traffic environments.
科研通智能强力驱动
Strongly Powered by AbleSci AI