YOLOv10: Real-Time End-to-End Object Detection

端到端原则 计算机科学 死胡同 对象(语法) 人工智能 数学 几何学 流量(数学)
作者
Ao Wang,Hui Chen,Lihao Liu,Kai Chen,Zijia Lin,Jungong Han,Guiguang Ding
出处
期刊:Cornell University - arXiv 被引量:1020
标识
DOI:10.48550/arxiv.2405.14458
摘要

Over the past years, YOLOs have emerged as the predominant paradigm in the field of real-time object detection owing to their effective balance between computational cost and detection performance. Researchers have explored the architectural designs, optimization objectives, data augmentation strategies, and others for YOLOs, achieving notable progress. However, the reliance on the non-maximum suppression (NMS) for post-processing hampers the end-to-end deployment of YOLOs and adversely impacts the inference latency. Besides, the design of various components in YOLOs lacks the comprehensive and thorough inspection, resulting in noticeable computational redundancy and limiting the model's capability. It renders the suboptimal efficiency, along with considerable potential for performance improvements. In this work, we aim to further advance the performance-efficiency boundary of YOLOs from both the post-processing and model architecture. To this end, we first present the consistent dual assignments for NMS-free training of YOLOs, which brings competitive performance and low inference latency simultaneously. Moreover, we introduce the holistic efficiency-accuracy driven model design strategy for YOLOs. We comprehensively optimize various components of YOLOs from both efficiency and accuracy perspectives, which greatly reduces the computational overhead and enhances the capability. The outcome of our effort is a new generation of YOLO series for real-time end-to-end object detection, dubbed YOLOv10. Extensive experiments show that YOLOv10 achieves state-of-the-art performance and efficiency across various model scales. For example, our YOLOv10-S is 1.8$\times$ faster than RT-DETR-R18 under the similar AP on COCO, meanwhile enjoying 2.8$\times$ smaller number of parameters and FLOPs. Compared with YOLOv9-C, YOLOv10-B has 46\% less latency and 25\% fewer parameters for the same performance.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
月地花开完成签到,获得积分10
刚刚
vothuong完成签到,获得积分10
刚刚
大个应助dxxxxx采纳,获得30
1秒前
111完成签到,获得积分10
1秒前
3秒前
3秒前
小璐发布了新的文献求助10
4秒前
5秒前
羽言发布了新的文献求助10
5秒前
霸气蛋挞完成签到 ,获得积分10
5秒前
雨水发布了新的文献求助10
6秒前
W星球Y族人完成签到,获得积分10
7秒前
zz完成签到 ,获得积分10
8秒前
sy完成签到,获得积分10
8秒前
盲目逛恋发布了新的文献求助10
9秒前
公子渔发布了新的文献求助10
9秒前
10秒前
nhanvm完成签到,获得积分10
10秒前
10秒前
10秒前
甜美凝芙完成签到,获得积分10
11秒前
11秒前
小马甲应助满意血茗采纳,获得10
11秒前
Fighting发布了新的文献求助10
12秒前
13秒前
爆米花应助公子渔采纳,获得10
13秒前
13秒前
张丽妍发布了新的文献求助10
14秒前
14秒前
15秒前
JamesPei应助阔达的沛儿采纳,获得10
16秒前
16秒前
17秒前
沉默羔羊完成签到,获得积分10
17秒前
沈澜发布了新的文献求助10
17秒前
xiaxia发布了新的文献求助10
18秒前
sun完成签到,获得积分10
18秒前
烟花应助聪明的心语采纳,获得10
18秒前
LingMg发布了新的文献求助10
19秒前
不知道完成签到,获得积分10
19秒前
高分求助中
Psychopathic Traits and Quality of Prison Life 1000
Chemistry and Physics of Carbon Volume 18 800
The formation of Australian attitudes towards China, 1918-1941 660
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6451648
求助须知:如何正确求助?哪些是违规求助? 8263408
关于积分的说明 17608060
捐赠科研通 5516304
什么是DOI,文献DOI怎么找? 2903709
邀请新用户注册赠送积分活动 1880647
关于科研通互助平台的介绍 1722662