MSFT-YOLO: Improved YOLOv5 Based on Transformer for Detecting Defects of Steel Surface

目标检测 计算机科学 人工智能 探测器 领域(数学) 变压器 特征(语言学) 计算机视觉 工程类 模式识别(心理学) 电压 电气工程 哲学 电信 纯数学 语言学 数学
作者
Zexuan Guo,Chensheng Wang,Guang Yang,Zeyuan Huang,Guo Li
出处
期刊:Sensors [Multidisciplinary Digital Publishing Institute]
卷期号:22 (9): 3467-3467 被引量:200
标识
DOI:10.3390/s22093467
摘要

With the development of artificial intelligence technology and the popularity of intelligent production projects, intelligent inspection systems have gradually become a hot topic in the industrial field. As a fundamental problem in the field of computer vision, how to achieve object detection in the industry while taking into account the accuracy and real-time detection is an important challenge in the development of intelligent detection systems. The detection of defects on steel surfaces is an important application of object detection in the industry. Correct and fast detection of surface defects can greatly improve productivity and product quality. To this end, this paper introduces the MSFT-YOLO model, which is improved based on the one-stage detector. The MSFT-YOLO model is proposed for the industrial scenario in which the image background interference is great, the defect category is easily confused, the defect scale changes a great deal, and the detection results of small defects are poor. By adding the TRANS module, which is designed based on Transformer, to the backbone and detection headers, the features can be combined with global information. The fusion of features at different scales by combining multi-scale feature fusion structures enhances the dynamic adjustment of the detector to objects at different scales. To further improve the performance of MSFT-YOLO, we also introduce plenty of effective strategies, such as data augmentation and multi-step training methods. The test results on the NEU-DET dataset show that MSPF-YOLO can achieve real-time detection, and the average detection accuracy of MSFT-YOLO is 75.2, improving about 7% compared to the baseline model (YOLOv5) and 18% compared to Faster R-CNN, which is advantageous and inspiring.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
爆米花应助alooof采纳,获得10
1秒前
1秒前
1秒前
FashionBoy应助2499297293采纳,获得10
2秒前
wzx完成签到,获得积分10
3秒前
hana完成签到,获得积分10
4秒前
排骨炖汤完成签到,获得积分10
4秒前
4秒前
xzf1996完成签到,获得积分10
5秒前
得己完成签到 ,获得积分10
7秒前
Orange应助panda采纳,获得10
8秒前
那就发个呆完成签到,获得积分10
8秒前
Giaodv完成签到 ,获得积分20
8秒前
Qun发布了新的文献求助10
9秒前
李苗苗发布了新的文献求助10
9秒前
10秒前
arniu2008发布了新的文献求助10
10秒前
乐乐应助粗心的电源采纳,获得10
10秒前
11秒前
专一的茗发布了新的文献求助10
11秒前
你我山巅自相逢完成签到 ,获得积分10
12秒前
12秒前
77完成签到,获得积分10
12秒前
执着的香薇完成签到,获得积分10
12秒前
激动的55完成签到 ,获得积分10
12秒前
怕黑捕完成签到,获得积分10
14秒前
李健的小迷弟应助齐刘海采纳,获得10
14秒前
15秒前
露露发布了新的文献求助10
15秒前
tiantian发布了新的文献求助30
16秒前
ding应助脑壳疼采纳,获得10
16秒前
小白菜完成签到,获得积分10
16秒前
KEYANKANG完成签到,获得积分10
17秒前
恍若隔世完成签到,获得积分20
18秒前
19秒前
free应助ranitidine采纳,获得20
19秒前
19秒前
20秒前
22秒前
栗子完成签到 ,获得积分10
22秒前
高分求助中
Psychopathic Traits and Quality of Prison Life 1000
Malcolm Fraser : a biography 680
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6452988
求助须知:如何正确求助?哪些是违规求助? 8264588
关于积分的说明 17612294
捐赠科研通 5518381
什么是DOI,文献DOI怎么找? 2904263
邀请新用户注册赠送积分活动 1881074
关于科研通互助平台的介绍 1723455