计算机科学
分割
图像分割
人工智能
计算机视觉
特征(语言学)
遥感
模式识别(心理学)
地质学
语言学
哲学
作者
Haoxue Zhang,Gang Xie,Linjuan Li,Xinlin Xie,Jinchang Ren
标识
DOI:10.1109/tgrs.2025.3535724
摘要
Convolutional Neural Networks (CNNs), transformers, and the hybrid methods have been significant application in remote sensing. However, existing methods are limited in effectively modeling frequency domain information, which affects their ability to capture detailed information. Therefore, we propose a frequency-domain guided feature coupled mechanism and a global-local feature integration method (FGNet) for semantic segmentation. Specifically, a frequency-domain guided Swin transformer (FGSwin) is designed by introducing dilation group convolution, Fast Fourier Transform (FFT) and learnable weights to enhance the expression capability of frequency-domain and space-domain, local and global features, simultaneously. In addition, a global-local feature integration module (GLFI) is proposed for aggregating features to further enhance the discrimination of each category. Comprehensive experimental results demonstrate that, compared to existing methods, the proposed method achieves superior performance in terms of mean intersection over union (mIoU), reaching 71.46% and 74.04% on the ISPRS Potsdam and Vaihingen, two widely used datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI