计算机科学
变压器
帕斯卡(单位)
目标检测
分类器(UML)
瓶颈
卷积神经网络
人工智能
建筑
特征提取
模式识别(心理学)
计算机工程
机器学习
嵌入式系统
工程类
程序设计语言
艺术
视觉艺术
电压
电气工程
作者
Xin Xie,Dengquan Wu,Mingye Xie,Zixi Li
标识
DOI:10.1016/j.patcog.2023.110172
摘要
The lightweight network model has gradually evolved into an important research direction in object detection. Network lightweight design has a variety of research methods, such as quantization, knowledge distillation, and neural architecture search. However, these methods either fail to break through the performance bottleneck of the model itself or require massive training costs. In order to solve these problems, a new object detection model based on CNN-Transformer hybrid feature extraction network called GhostFormer is proposed from the perspective of lightweight network structure design. GhostFormer makes full use of the advantages of local modeling of CNN and global modeling of Transformer, not only effectively reducing the complexity of the convolution model but also breaking through the limitation of Transformer's lack of inductive bias. Finally, better transfer results are obtained in downstream tasks. Experiments show that the model is less than half as computationally expensive as YOLOv7 on the Pascal VOC dataset, with only about 3 % [email protected] loss, and 9.7% [email protected]:0.95 improvement on the MS COCO dataset compared with GhostNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI