计算机科学
人工智能
目标检测
深度学习
卷积神经网络
变压器
管道(软件)
人工神经网络
机器学习
对象(语法)
模式识别(心理学)
计算机视觉
量子力学
物理
电压
程序设计语言
作者
Yibo Sun,Zhe Sun,Weitong Chen
标识
DOI:10.1016/j.engappai.2024.108458
摘要
Object detection is one of the most important domains in computer vision tasks, which is an important branch of artificial intelligence. It aims at finding and locating the accurate position of objects in given pictures or videos. With the development of deep learning techniques, more powerful and robust algorithms have emerged to deal with multi-scale, high-level features to overcome the limitations of traditional pipeline of object detectors. The popularity of transformer framework enables larger capacity datasets by processing self-attention mechanism, and the object detection methods have evolved into a new era. This paper first reviews traditional object detection pipeline and brief history of deep learning, afterwards it focuses on the classification of deep learning-based object detection methods covering Convolution Neural Network based and transformer-based methods. Commonly used datasets and metrics are also covered in the next part. The Convolution Neural Network based methods mainly contain two-stage and one-stage detectors, Convolution Neural Network is the underlying structure of these methods convolutional stages are fundamental parts. Transformer-based models convert traditional object detection issues into end-to-end detection, which is widely used in dealing with images. Finally, the promising future of object detection areas are listed to show guidance on future work.
科研通智能强力驱动
Strongly Powered by AbleSci AI