计算机科学
保险丝(电气)
骨干网
棱锥(几何)
目标检测
卷积神经网络
融合机制
融合
人工智能
探测器
过程(计算)
特征学习
特征(语言学)
模式识别(心理学)
块(置换群论)
频道(广播)
计算机视觉
特征提取
工程类
数学
计算机网络
哲学
电气工程
操作系统
脂质双层融合
电信
语言学
几何学
作者
Xiuling Zhang,Tingbo Wan,Ziyun Wu,Bingce Du
标识
DOI:10.1007/s10489-021-02545-6
摘要
YOLOv4-tiny is a simplified version of YOLOv4 detector, which is extremely fast and with few parameters. However, the detection performance of YOLOv4-tiny is poor while the recognition of small targets and occluded objects is weak. It is mainly attributed to the lack of feature extraction ability and learning ability of the backbone. Furthermore, the feature pyramid network (FPN) cannot adequately fuse adjacent feature maps in the process of multi-scale feature fusion. In this paper, a real-time detector with a bi-channel feature fusion mechanism is proposed based on YOLOv4-tiny, called BFF-YOLO, which effectively improves the detection of small targets and occluded objects. BFF-YOLO is composed of two main components: feature extraction and feature fusion. In the process of feature extraction, inspired by the idea of cross-stage partial connections (CSP), an enhanced CSP block (ECSPBlock) is proposed for enhancing the feature extraction of the backbone and the learning capability of the network. Moreover, the Maxpool layer in YOLOv4-tiny, which is used for downsampling and tends to lose fine-grained information, is replaced with a convolutional layer. In the process of feature fusion, a bi-channel feature fusion pyramid network (BFPN) is proposed to adequately fuse adjacent feature maps of different scales so that each detection head has both shallow and deep features. Finally, with a small increase in parameters, BFF-YOLO has achieved 36.5% AP and 85.1% mAP on the COCO and VOC datasets, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI