高光谱成像
计算机科学
人工智能
模式识别(心理学)
卷积神经网络
编码器
变压器
嵌入
空间分析
数学
物理
量子力学
电压
操作系统
统计
作者
Haoyang Yu,Zhen Xu,Ke Zheng,Danfeng Hong,Hao Yang,Ming Song
标识
DOI:10.1109/tgrs.2022.3186400
摘要
Convolutional neural networks (CNN) have been widely used in hyperspectral image classification (HSIC). Although the current CNN-based methods have achieved good performance, they still face a series of challenges. For example, the receptive field is limited, information is lost in down-sampling layer, and a lot of computing resources are consumed for deep networks. To overcome these problems, we proposed a multi-level spectral-spatial transformer network (MSTNet) for HSIC. The structure of MSTNet is an image-based classification framework, which is efficient and straightforward. Based on this framework, we designed a self-attentive encoder. Firstly, HSIs are processed into sequences. Meanwhile, a learned positional embedding is added to integrate spatial information. Then, a pure transformer encoder is employed to learn feature representations. Finally, the multi-level features are processed by decoders to generate the classification results in the original image size. The experimental results based on three real hyperspectral data sets demonstrate the efficiency of the proposed method in comparison with the other related CNN-based methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI