目标检测
人工智能
计算机视觉
计算机科学
障碍物
对象(语法)
Viola–Jones对象检测框架
卷积神经网络
行人检测
对象类检测
箱子
模式识别(心理学)
行人
工程类
人脸检测
面部识别系统
政治学
运输工程
法学
算法
作者
Helawe Behailu Erdaw,Yesuneh Getachew Taye,Dereje Teferi Lemma
标识
DOI:10.1109/ict4da59526.2023.10302240
摘要
Computer vision is one of the state-of-the-art technologies for object detection problems. Accurate detection of obstacles could assist blind and visually impaired people to n avigate safely while they walk. However, object detection and t racking are some of the challenging tasks in computer vision. I n video analysis, there are basic steps to be done: detection of target objects in consecutive frames, and analysis of objects to understand their behavior. Recently, several methods of object detection based convolutional neural networks have improved performance undue different conditions like speed and accurac y. However, most of these methods have slow recognition speed that limits their use in real-time situations. Recently, a unified object detection model You Only Look Once (YOLO) was prop osed, which could directly regress from input image to object cl ass scores and positions. In this paper, we applied YOLOv2 to our prepared datasets of three different classes namely pothole, garbage bin, and pole and we proposed a technique called Short-Term Memory, which considers information between every f rame, to reinforce the detection capability of YOLO in video st reaming by including obj ect location and size estimation tasks. Through this, we achieved a mean average precision of 60.17% accuracy with an average detection speed of 34.6 fps.
科研通智能强力驱动
Strongly Powered by AbleSci AI