变压器
计算机科学
图像(数学)
计算机视觉
人工智能
电气工程
工程类
电压
作者
Wei Long,Xingyu Zhou,Leheng Zhang,Shuhang Gu
出处
期刊:Cornell University - arXiv
日期:2025-03-26
标识
DOI:10.48550/arxiv.2503.20337
摘要
Transformer-based methods have achieved remarkable results in image super-resolution tasks because they can capture non-local dependencies in low-quality input images. However, this feature-intensive modeling approach is computationally expensive because it calculates the similarities between numerous features that are irrelevant to the query features when obtaining attention weights. These unnecessary similarity calculations not only degrade the reconstruction performance but also introduce significant computational overhead. How to accurately identify the features that are important to the current query features and avoid similarity calculations between irrelevant features remains an urgent problem. To address this issue, we propose a novel and effective Progressive Focused Transformer (PFT) that links all isolated attention maps in the network through Progressive Focused Attention (PFA) to focus attention on the most important tokens. PFA not only enables the network to capture more critical similar features, but also significantly reduces the computational cost of the overall network by filtering out irrelevant features before calculating similarities. Extensive experiments demonstrate the effectiveness of the proposed method, achieving state-of-the-art performance on various single image super-resolution benchmarks.
科研通智能强力驱动
Strongly Powered by AbleSci AI