计算机科学
人工智能
冗余(工程)
计算机视觉
块(置换群论)
依赖关系(UML)
计算复杂性理论
模式识别(心理学)
图像(数学)
图像处理
四叉树
二次方程
领域(数学)
编码(内存)
像素
图像压缩
缩小
块大小
图像复原
感受野
棱锥(几何)
变换编码
遗忘
缩放比例
空间相关性
安全性令牌
作者
Junbo Qiao,Jincheng Liao,Wei Li,Yulun Zhang,Yong Guo,Jiao Xie,Jie Hu,Shaohui Lin
标识
DOI:10.1109/tip.2025.3643146
摘要
Despite Transformers have achieved significant success in low-level vision tasks, they are constrained by computing self-attention with a quadratic complexity and limited-size windows. This limitation results in a lack of global receptive field across the entire image. Recently, State Space Models (SSMs) have gained widespread attention due to their global receptive field and linear complexity with respect to input length. However, integrating SSMs into low-level vision tasks presents two major challenges: 1) Relationship degradation of long-range tokens with a long-range forgetting problem by encoding pixel-by-pixel high-resolution images. 2) Significant redundancy in the existing multi-direction scanning strategy. To this end, we propose Hi-Mamba for image super-resolution (SR) to address these challenges, which unfolds the image with only a single scan. Specifically, the Global Hierarchical Mamba Block (GHMB) enables token interactions across the entire image, providing a global receptive field while leveraging a multi-scale structure to facilitate long-range dependency learning. Additionally, the Direction Alternation Module (DAM) adjusts the scanning patterns of GHMB across different layers to enhance spatial relationship modeling. Extensive experiments demonstrate that our Hi-Mamba achieves 0.2-0.27dB PSNR gains on the Urban100 dataset across different scaling factors compared to the state-of-the-art MambaIRv2 for SR. Moreover, our lightweight Hi-Mamba also outperforms lightweight SRFormer by 0.39dB PSNR for $\times 2$ SR.
科研通智能强力驱动
Strongly Powered by AbleSci AI