分割
计算机科学
变压器
对偶(语法数字)
高分辨率
遥感
人工智能
计算机视觉
地质学
工程类
电气工程
艺术
文学类
电压
作者
Xiaotong Zhu,Tzu Wei Peng,Jia Guo,Hao Wang,Taotao Cao
摘要
Abstract High‐resolution remote sensing images play an important role in geological surveys, disaster detection, and other fields. However, highly imbalanced ground target classes and easily confused small ground targets pose significant challenges to the semantic segmentation task. We propose IC‐TransUNet, a new dual branch model based on an encoder‐decoder structure that fully exploits the advantages of convolutional neural networks and transformers and considers both detailed information and semantic information capture. Specifically, a lightweight CSwin transformer and InceptionNeXt are used as the dual branch backbone of the model. To further improve the model performance, first, we designed the InceptionNeXt‐CSwin Transformer Fusion Module (ICFM) and Edge Enhancement Module (EEM) to guide the dual branch backbone to obtain features. Second, a detachable Spatial‐channel Attention Fusion Module (SCAFM) is designed to be flexibly inserted into multiple positions of the model. Finally, we designed a decoder with significant performance based on a global local transformer block, SCAFM, and a multilayer perceptron segmentation head. IC‐TransUNet achieved highly competitive performance in experiments on the Vaihingen and Potsdam datasets from the International Society for Photogrammetry and Remote Sensing.
科研通智能强力驱动
Strongly Powered by AbleSci AI