计算机科学
保险丝(电气)
分割
传感器融合
人工智能
融合
变压器
RGB颜色模型
计算机视觉
图像分辨率
遥感
工程类
电压
哲学
地质学
电气工程
语言学
作者
Xianping Ma,Xiaokang Zhang,Man-On Pun,Ming Liu
标识
DOI:10.1109/igarss46834.2022.9883789
摘要
This work proposes a Multi-Stage Fusion Network (MSFNet) for semantic segmentation of fine-resolution remote sensing data by exploiting a multi -stage transformer architec-ture. The proposed MSFNet fuses information of different scales and modalities using a multi-stage scheme based on cross-attention mechanism. More specifically, the proposed MSFNet is composed of two Multi-Level Transformers (ML-Trans), one Crossmodal Fusion Transformer (CFTrans) and one Global-Context Augmented Transformer (GCATrans). DMLTrans and CFTrans are designed to fuse features in dif-ferent levels in each modality and high-level crossmodal ab-stract features, respectively, whereas GCATrans enhances the fusion feature of the main modal. Capitalizing on MSFNet, this work demonstrates the fusion of red-green-blue (RGB) remote sensing images and digital surface model (DSM) data. Extensive experiments on large-scale fine-resolution remote sensing data sets, namely the ISPRS Vaihingen, confirm the excellent performance of the proposed architecture as compared to conventional multimodal methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI