探测器
计算机科学
级联
人工智能
目标检测
对象(语法)
卷积神经网络
计算机视觉
模式识别(心理学)
电信
工程类
化学工程
作者
Chien-Yao Wang,Alexey Bochkovskiy,Hong-Yuan Mark Liao
出处
期刊:Cornell University - arXiv
日期:2022-01-01
被引量:382
标识
DOI:10.48550/arxiv.2207.02696
摘要
YOLOv7 surpasses all known object detectors in both speed and accuracy in the range from 5 FPS to 160 FPS and has the highest accuracy 56.8% AP among all known real-time object detectors with 30 FPS or higher on GPU V100. YOLOv7-E6 object detector (56 FPS V100, 55.9% AP) outperforms both transformer-based detector SWIN-L Cascade-Mask R-CNN (9.2 FPS A100, 53.9% AP) by 509% in speed and 2% in accuracy, and convolutional-based detector ConvNeXt-XL Cascade-Mask R-CNN (8.6 FPS A100, 55.2% AP) by 551% in speed and 0.7% AP in accuracy, as well as YOLOv7 outperforms: YOLOR, YOLOX, Scaled-YOLOv4, YOLOv5, DETR, Deformable DETR, DINO-5scale-R50, ViT-Adapter-B and many other object detectors in speed and accuracy. Moreover, we train YOLOv7 only on MS COCO dataset from scratch without using any other datasets or pre-trained weights. Source code is released in https://github.com/WongKinYiu/yolov7.
科研通智能强力驱动
Strongly Powered by AbleSci AI