CAAT: Image super-resolution algorithm via channel attention and transformer

计算机科学 判别式 变压器 算法 人工智能 卷积神经网络 模式识别(心理学) 特征提取 电子工程 失败 频道(广播) 堆积 自回归模型 图像质量
作者
Yuantao Chen,Liuhan Chen,Runlong Xia,Kai Yang,Ke Zou
出处
期刊:Array [Elsevier BV]
卷期号:28: 100628-100628 被引量:10
标识
DOI:10.1016/j.array.2025.100628
摘要

Deep learning-based single image super-resolution (SISR) has achieved remarkable progress, yet the trade-off between reconstruction quality and computational efficiency remains a critical challenge for real-time applications. This paper proposes a novel Channel Attention and Transformer framework (CAAT) that synergistically integrates convolutional operations with Swin Transformer blocks to achieve lightweight yet high-performance SR. The core innovation lies in the Channel-Attention-Embedded Transformer Block, which adaptively injects channel attention mechanisms into both Transformer self-attention and convolutional feature streams, enabling discriminative feature selection and cross-modal fusion at the block level. By alternately stacking convolution and Transformer layers with channel-wise adaptive weighting, proposed leverages their complementary strengths in local detail preservation and global context modeling while maintaining model compactness. Extensive evaluations on five benchmark datasets across three scales demonstrate that proposed achieves superior performance over six state-of-the-art methods. Notably, at × 4 magnification, proposed attains 0.09 dB PSNR improvement on Urban100 and 0.30 dB on Manga109 compared to the best counterparts, while reducing parameters by 51 % versus SwinIR and FLOPs by 68 % (195.6G vs. 612.6G for 1280 × 720 input). These results, substantiated by statistical significance tests and ablation studies, confirm proposed efficacy as a cost-effective solution for real-time SR deployments. • The proposed method had fused channel attention and transformer module. • The module designed for four stages and related operations. • The adaptive discriminative enhancement strategy has adopted. • The output features of different layers and filters are combined with channel attention.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科研狗应助Lzt采纳,获得50
刚刚
baner发布了新的文献求助10
刚刚
阔达书雪发布了新的文献求助10
刚刚
哈哈哈发布了新的文献求助10
1秒前
1秒前
1秒前
2秒前
小分队发布了新的文献求助10
2秒前
2秒前
12发布了新的文献求助10
3秒前
ghmghm9910发布了新的文献求助10
3秒前
SJD完成签到,获得积分0
4秒前
饱满以云完成签到,获得积分10
4秒前
Sky完成签到,获得积分10
4秒前
zz完成签到,获得积分10
4秒前
清脆芮完成签到,获得积分10
4秒前
kong完成签到,获得积分10
5秒前
6秒前
vadfdfb发布了新的文献求助10
7秒前
7秒前
哈哈哈完成签到,获得积分20
7秒前
7秒前
8秒前
科研通AI6.2应助白昼流星采纳,获得10
8秒前
8秒前
科研通AI6.2应助baner采纳,获得10
8秒前
科研通AI6.4应助aaaaaa采纳,获得10
8秒前
Xj发布了新的文献求助10
8秒前
李小强完成签到,获得积分10
9秒前
hzl完成签到,获得积分10
9秒前
9秒前
面包完成签到,获得积分10
10秒前
乐乐应助牛油果采纳,获得10
10秒前
无事发生发布了新的文献求助10
10秒前
七安完成签到 ,获得积分10
10秒前
10秒前
10秒前
ghmghm9910完成签到,获得积分10
10秒前
风清扬应助科研通管家采纳,获得30
10秒前
10秒前
高分求助中
Annie Ernaux: De la perte au corps glorieux 600
类器官构建与应用:从基础到前沿 500
Petrology and Plate Tectonics,2025 500
Optical Coating Design with the Essential Macleod 400
A revision of Limenitis helmanni and its related species (Nymphalidae) from Central and South China 400
Moore's Clinically Oriented Anatomy 10th Edition 400
Direct and Iterative Linear System Solvers 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6786093
求助须知:如何正确求助?哪些是违规求助? 8507955
关于积分的说明 18120130
捐赠科研通 6092441
什么是DOI,文献DOI怎么找? 3020232
邀请新用户注册赠送积分活动 1997129
关于科研通互助平台的介绍 1984075