计算机科学
人工智能
联营
级联
特征提取
偏移量(计算机科学)
粒度
模式识别(心理学)
卷积神经网络
建筑
地理空间分析
数据挖掘
编码器
棱锥(几何)
计算机视觉
遥感
艺术
化学
物理
视觉艺术
色谱法
光学
程序设计语言
操作系统
地质学
作者
Sixian Chan,Yuan Wang,Yanjing Lei,Xu Cheng,Zhaomin Chen,Wei Wu
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing
[Institute of Electrical and Electronics Engineers]
日期:2023-01-01
卷期号:61: 1-18
被引量:4
标识
DOI:10.1109/tgrs.2023.3306018
摘要
The U-Net-like model has been widely studied in the field of building extraction. However, most of these models are based on locally sensed Convolutional Neural Networks(CNNs) designed with symmetric structure and single feature processing, which cannot accurately identify buildings with different sizes, shapes, and colors in remote sensing images. To overcome these problems, we propose the asymmetric cascade fusion network(ACFN), based on the Vision Transformer(ViT), to design a novel asymmetric architecture to recognize buildings of different sizes and shapes by processing multi-granularity features by different means. First, the asymmetric architecture obtains multi-granularity features with global contextual information by embedding different types of attention in encoder-decoders of different sizes. This architecture can identify densely distributed and occluded buildings by semantic reasoning in remote sensing images with complex information. Second, we design a multi-branch weighted pyramid pooling module, which sets different branch weights to offset the background noise introduced in introducing global contextual information. Our ACFN significantly improves the Beijing buildings, ISPRS-Vaihingen, and LoveDA datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI