计算机科学
特征(语言学)
网(多面体)
分割
语义特征
遥感
人工智能
图像(数学)
模式识别(心理学)
计算机视觉
地质学
数学
哲学
语言学
几何学
作者
Gyutae Hwang,Jiwoo Jeong,Sang Jun Lee
出处
期刊:Remote Sensing
[Multidisciplinary Digital Publishing Institute]
日期:2024-09-03
卷期号:16 (17): 3278-3278
被引量:17
摘要
Advances in deep learning and computer vision techniques have made impacts in the field of remote sensing, enabling efficient data analysis for applications such as land cover classification and change detection. Convolutional neural networks (CNNs) and transformer architectures have been utilized in visual perception algorithms due to their effectiveness in analyzing local features and global context. In this paper, we propose a hybrid transformer architecture that consists of a CNN-based encoder and transformer-based decoder. We propose a feature adjustment module that refines the multiscale feature maps extracted from an EfficientNet backbone network. The adjusted feature maps are integrated into the transformer-based decoder to perform the semantic segmentation of the remote sensing images. This paper refers to the proposed encoder–decoder architecture as a semantic feature adjustment network (SFA-Net). To demonstrate the effectiveness of the SFA-Net, experiments were thoroughly conducted with four public benchmark datasets, including the UAVid, ISPRS Potsdam, ISPRS Vaihingen, and LoveDA datasets. The proposed model achieved state-of-the-art accuracy on the UAVid, ISPRS Vaihingen, and LoveDA datasets for the segmentation of the remote sensing images. On the ISPRS Potsdam dataset, our method achieved comparable accuracy to the latest model while reducing the number of trainable parameters from 113.8 M to 10.7 M.
科研通智能强力驱动
Strongly Powered by AbleSci AI