变压器
计算机科学
特征提取
人工智能
模式识别(心理学)
工程类
电气工程
电压
作者
Wenfeng Zheng,Siyu Lu,Youshuai Yang,Zhengtong Yin,Lirong Yin
出处
期刊:PeerJ
[PeerJ, Inc.]
日期:2024-01-31
卷期号:10: e1755-e1755
被引量:118
标识
DOI:10.7717/peerj-cs.1755
摘要
In recent years, the image feature extraction method based on Transformer has become a research hotspot. However, when using Transformer for image feature extraction, the model's complexity increases quadratically with the number of tokens entered. The quadratic complexity prevents vision transformer-based backbone networks from modelling high-resolution images and is computationally expensive. To address this issue, this study proposes two approaches to speed up Transformer models. Firstly, the self-attention mechanism's quadratic complexity is reduced to linear, enhancing the model's internal processing speed. Next, a parameter-less lightweight pruning method is introduced, which adaptively samples input images to filter out unimportant tokens, effectively reducing irrelevant input. Finally, these two methods are combined to create an efficient attention mechanism. Experimental results demonstrate that the combined methods can reduce the computation of the original Transformer model by 30%-50%, while the efficient attention mechanism achieves an impressive 60%-70% reduction in computation.
科研通智能强力驱动
Strongly Powered by AbleSci AI