SwinFG: A fine-grained recognition scheme based on swin transformer

判别式 计算机科学 人工智能 变压器 模式识别(心理学) 计算 算法 物理 量子力学 电压
作者
Zhipeng Ma,Xiaoyu Wu,Anzhuo Chu,Lei Huang,Zhiqiang Wei
出处
期刊:Expert Systems With Applications [Elsevier BV]
卷期号:244: 123021-123021 被引量:21
标识
DOI:10.1016/j.eswa.2023.123021
摘要

Fine-grained image recognition (FGIR) is a challenging task as it requires the recognition of sub-categories with subtle differences. Recently, the swin transformer has shown impressive performance in various fields. Our research has shown that swin transformer applied directly to FGIR is also highly effective compared to many other approaches and can be further enhanced with adaptive improvements. In this paper, we propose a novel swin transformer based architecture, named SwinFG, which enhances FGIR by leveraging shifted window based self-attention to locate discriminative regions. The self-attention computation fuses image patches together based on attention weights, enabling the subsequent influence of each patch to be tracked and its contribution to the extracted feature to be determined. This forms the basis for locating discriminative regions. To this end, we propose a series of transformations that integrate the attention weights of local windows in each block into attention maps, which can be recursively multiplied to track changes in the attention weights. As the discriminative regions are not entirely occupied by the foreground object, the background information is also expressed in the extracted feature inevitably. To address this, we propose conducting contrastive learning on features obtained from both the discriminative and background regions of a single image to enlarge their distance and further eliminate any potential influence from the background. We demonstrate the state-of-the-art performance of our model on four popular fine-grained benchmarks. (The code is available at https://anonymous.4open.science/r/swinFG-1DCE).
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
li完成签到 ,获得积分0
2秒前
3秒前
林好人完成签到 ,获得积分10
5秒前
MUAN完成签到 ,获得积分10
5秒前
笨笨如之完成签到 ,获得积分10
9秒前
英吉利25发布了新的文献求助10
9秒前
完美的安荷完成签到 ,获得积分10
13秒前
14秒前
19秒前
19秒前
杨扬完成签到,获得积分10
24秒前
25秒前
小天小天完成签到 ,获得积分10
25秒前
小熊完成签到 ,获得积分10
27秒前
Dellamoffy完成签到,获得积分10
28秒前
31秒前
白昼完成签到 ,获得积分10
31秒前
哎呀哎呀呀完成签到,获得积分10
37秒前
英吉利25发布了新的文献求助10
37秒前
37秒前
42秒前
44秒前
gsokok完成签到,获得积分10
44秒前
47秒前
翰飞寰宇完成签到 ,获得积分10
48秒前
你好纠结伦完成签到,获得积分10
48秒前
大布丁发布了新的文献求助10
52秒前
54秒前
标致的满天完成签到 ,获得积分10
55秒前
直率若烟完成签到 ,获得积分10
57秒前
记上没文献了完成签到 ,获得积分10
57秒前
陈A完成签到 ,获得积分10
1分钟前
养花低手完成签到 ,获得积分10
1分钟前
爆米花应助tanjuan采纳,获得10
1分钟前
1分钟前
高兴薯片完成签到 ,获得积分10
1分钟前
Ryan完成签到 ,获得积分10
1分钟前
异烟肼完成签到 ,获得积分10
1分钟前
tmobiusx完成签到,获得积分10
1分钟前
MRJJJJ完成签到,获得积分10
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Introduction to Helicopter and Tiltrotor Flight Simulation, Second Edition 2500
卤化钙钛矿人工突触的研究 2000
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Software that combines deep learning,3D reconstruction and CFD to analyze the state of carotid arteries from ultrasound imaging 600
Bounds for Statistical Estimation in Semiparametric Models 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6497644
求助须知:如何正确求助?哪些是违规求助? 8293728
关于积分的说明 17696139
捐赠科研通 5593326
什么是DOI,文献DOI怎么找? 2917419
邀请新用户注册赠送积分活动 1894351
关于科研通互助平台的介绍 1754749