高光谱成像
频域
人工智能
模式识别(心理学)
计算机科学
变压器
遥感
计算机视觉
工程类
地质学
电压
电气工程
作者
Cheng Shi,P. C. Y. Chen,Li Fang,Minghua Zhao,Xinhong Hei,Qiguang Miao
标识
DOI:10.1109/tim.2025.3578703
摘要
Recently, Transformer has achieved significant success in hyperspectral image (HSI) classification task. However, most Transformer and its variants focus more on spatial domain global feature learning, ignoring the complementary characteristics provided by frequency domain features. The Fast Fourier Transform (FFT), due to its sensitivity for frequency domain information, has become a primary tool for frequency domain analysis. However, different frequency bands are often assigned the same attention values, and the differences between different frequency bands are not considered. To fully explore and fusion spatial and frequency domain features, we propose a multi-scale spatial-frequency domain cross transformer (SFDCT-Former) network. We design a two-branch structure for spatial domain and frequency domain feature learning: one branch utilizes the multi-head self-attention (MHSA) module for spatial domain feature learning, while the other incorporates a Multi-Frequency Domain Transformer (MFre-Former) encoder for frequency domain feature learning. The MFre-Former encoder divides the frequency domain into non-overlapping frequency bands and assigns distinct attention to each frequency band, therefore, different frequency domain information can be captured more precisely. Furthermore, to fuse the spatial and frequency domain features, we design a Multi-Level Cross Attention (MLCA) fusion module. The MLCA module effectively combines spatial and frequency domain features at different levels to better capture their complementary characteristics. Extensive experiments conduct on four publicly available HSI datasets demonstrate that the proposed method outperforms nine state-of-the-art methods in classification performance. The code is available at https://github.com/AAAA-CS/SFDCT-Former.
科研通智能强力驱动
Strongly Powered by AbleSci AI