感受野
计算机科学
算法
残余物
帕斯卡(单位)
集合(抽象数据类型)
领域(数学)
特征(语言学)
卷积神经网络
人工智能
模式识别(心理学)
数学
语言学
哲学
纯数学
程序设计语言
作者
Enhui Chai,Lin Ta,Zhanfei Ma,Min Zhi
标识
DOI:10.1016/j.imavis.2021.104317
摘要
Research shows that theoretical receptive field and effective receptive field are very important to target detection results. The effective receptive field determines the contribution of different positions in the theoretical receptive field. Therefore, the main purpose of this work is to increase the effective receptive field area and reduce the number of parameters. This idea obtains a high-precision and high-speed target detector. First, the algorithm needs to optimize the activation function to improve the efficiency of feature extraction. Second, the model structure needs to select the backbone network and improve the convolutional layer structure. Then, the enhanced network requires increasing the number of the residual structures and the “Concat” to improve feature extraction performance. Finally, the network needs to combine the optimized convolutional layer and the anchor box loss function to improve the performance of the anchor box. The project designed a YOLO algorithm (ERF-YOLO) with a larger effective receptive field. The training and testing of the experiment use PASCAL VOC data set and MS COCO data set respectively. Experimental results show that the parameter of ERF-YOLO is close to half of YOLO v4. In terms of detection accuracy, ERF-YOLO is superior to many current algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI