计算机科学
编码器
高光谱成像
人工智能
变压器
模式识别(心理学)
卷积神经网络
空间分析
安全性令牌
遥感
物理
量子力学
电压
地质学
操作系统
计算机安全
作者
Er Ouyang,Bin Li,Wenjing Hu,Guoyun Zhang,Lin Zhao,Jianhui Wu
标识
DOI:10.1109/tgrs.2023.3242978
摘要
The transformer framework has shown great potential in the field of hyperspectral image (HSI) classification due to its superior global modeling capabilities compared to convolutional neural networks (CNNs). To utilize the transformer to model spatial–spectral information, a hybrid transformer that integrates multigranularity tokens and spatial–spectral attention (SSA) is proposed. Specifically, a token generator is designed to embed the multigranularity semantic tokens, which contributes richer image features to the model by exploiting CNN’s local representation capability. Moreover, a transformer encoder with an SSA mechanism is proposed to capture the global dependencies between different tokens, enabling the model to focus on more differentiated channels and spatial locations to improve the classification accuracy. Ultimately, adaptive weighted fusion is applied to different granularity transformer branches to boost HybridFormer’s classification performance. Experiments were conducted on four new challenging datasets, and the results indicate that HybridFormer achieves state-of-the-art results in terms of classification performance. The code of this work will be available at https://github.com/zhaolin6/HybridFormer for the sake of reproducibility.
科研通智能强力驱动
Strongly Powered by AbleSci AI