An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network

计算机科学 模式识别(心理学) 人工智能 融合 特征(语言学) 比例(比率) 图层(电子) 计算机视觉 材料科学 物理 语言学 哲学 量子力学 复合材料
作者
Zhong Qu,Le-yuan Gao,Shengye Wang,Haonan Yin,Tuming Yi
出处
期刊:Image and Vision Computing [Elsevier]
卷期号:125: 104518-104518 被引量:9
标识
DOI:10.1016/j.imavis.2022.104518
摘要

SSD and YOLOv5 are the one-stage object detector representative algorithms. An improved one-stage object detector based on the YOLOv5 method is proposed in this paper, named Multi-scale Feature Cross-layer Fusion Network (M-FCFN). Firstly, we extract shallow features and deep features from the PANet structure for cross-layer fusion and obtain a feature scale different from 80 × 80, 40 × 40, and 20 × 20 as output. Then, according to the single shot multi-box detector, we propose the different scale features which are obtained by cross-layer fusion for dimension reduction and use it as another output for prediction. Therefore, two completely different feature scales are added as the output. Features of different scales are necessary for detecting objects of different sizes, which can increase the probability of object detection and significantly improve detection accuracy. Finally, aiming at the Autoanchor mechanism proposed by YOLOv5, we propose an EIOU k-means calculation. We have compared the four model structures of S , M , L , and X of YOLOv5 respectively. The problem of missed and false detections for large objects is improved which has better detection results. The experimental results show that our methods achieve 89.1% and 67.8% mAP @0.5 on the PASCAL VOC and MS COCO datasets. Compared with the YOLOv5_S, our methods improve by 4.4% and 1.4% mAP @ [0.5:0.95] on the PASCAL VOC and MS COCO datasets. Compared with the four models of YOLOv5, our methods have better detection accuracy for large objects. It should be more attention that our method on the large-scale mAP @ [0.5:0.95] is 5.4% higher than YOLOv5_S on the MS COCO datasets. • We proposed Multi-scale Feature Cross-layer Fusion Network (M-FCFN). • Two completely different feature scales are added as the output. • We propose an EIOU k-means Autoanchor calculation. • The problem of missed and false detections for large objects is improved. • Our method on the large-scale mAP @[0.5:0.95] is 5.4% higher than YOLOv5_S.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Akim应助哈哈哈采纳,获得10
4秒前
4秒前
Buaa_Jack发布了新的文献求助20
7秒前
科研通AI2S应助Lee采纳,获得10
8秒前
轻松的惜雪完成签到 ,获得积分10
10秒前
沉静妙之完成签到 ,获得积分10
12秒前
15秒前
桂花乌龙完成签到,获得积分10
18秒前
哈喽发布了新的文献求助10
22秒前
小黄人完成签到,获得积分10
23秒前
隐形曼青应助deng采纳,获得10
23秒前
23秒前
问之完成签到,获得积分10
25秒前
XRT完成签到,获得积分10
25秒前
zhongbo完成签到,获得积分10
26秒前
ssss关注了科研通微信公众号
27秒前
XRT发布了新的文献求助10
28秒前
今天也要加油呀完成签到,获得积分10
28秒前
29秒前
华仔应助Lee采纳,获得10
32秒前
支连虎完成签到 ,获得积分10
35秒前
cctv18应助武雨寒采纳,获得10
35秒前
姬昂完成签到 ,获得积分10
36秒前
36秒前
ponny2001发布了新的文献求助10
39秒前
Lucas应助你怎么那么美采纳,获得10
39秒前
40秒前
40秒前
初夏发布了新的文献求助10
41秒前
41秒前
41秒前
wd完成签到,获得积分10
42秒前
pumcerzj发布了新的文献求助10
45秒前
lwroche发布了新的文献求助10
47秒前
ssss发布了新的文献求助10
48秒前
cc0725发布了新的文献求助10
48秒前
满意的醉蝶完成签到,获得积分10
49秒前
洛城花下完成签到,获得积分10
50秒前
情怀应助Lee采纳,获得10
56秒前
1分钟前
高分求助中
Formgebungs- und Stabilisierungsparameter für das Konstruktionsverfahren der FiDU-Freien Innendruckumformung von Blech 1000
The Illustrated History of Gymnastics 800
The Bourse of Babylon : market quotations in the astronomical diaries of Babylonia 680
Division and square root. Digit-recurrence algorithms and implementations 500
Elgar Encyclopedia of Consumer Behavior 300
機能營養學前瞻(3 Ed.) 300
Improving the ductility and toughness of Fe-Cr-B cast irons 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2508963
求助须知:如何正确求助?哪些是违规求助? 2159424
关于积分的说明 5528832
捐赠科研通 1879868
什么是DOI,文献DOI怎么找? 935391
版权声明 564126
科研通“疑难数据库(出版商)”最低求助积分说明 499453