计算机科学
特征提取
人工智能
卷积神经网络
模式识别(心理学)
编码器
感受野
特征(语言学)
领域(数学)
背景(考古学)
计算机视觉
数学
地质学
语言学
操作系统
哲学
古生物学
纯数学
作者
Yuanhao Zhao,Genyun Sun,Li Zhang,Aizhu Zhang,Xiuping Jia,Zheng Han
标识
DOI:10.1109/tgrs.2023.3282926
摘要
Extracting buildings from remote sensing images plays an important role in urban development planning, disaster assessment and mapping. Convolutional neural network (CNN) has been widely applied to building extraction because of its powerful deep semantic feature extraction ability. However, existing CNN-based building extraction methods are difficult to accurately extract multiscale buildings with accurate edges because of the limitation of feature receptive fields and the loss of spatial detail information. For the above problems, this paper proposes a multiscale receptive field network (MSRF-Net) to accurately extract multiscale buildings from remote sensing images. MSRF-Net includes multiscale receptive field feature encoder (MRFF-Encoder) and multipath decoder. In the MRFF-Encoder, a multiscale attentional down (MSAD) module and asymmetric residual inception (ARI) module are proposed to capture multiscale receptive field features. In the multipath decoder, convolutions with different kernel size and dilation are used in three parallel paths to learn localization-preserved multiscale features with multiscale receptive field. What's more, the features of different branches and MRFF-Encoder are fused by the proposed feature combination module, which contribute to capture context information of multiscale receptive field while recovering the resolution of feature space. The experimental results show that compared with the latest MAP-Net, MSRF-Net has achieved F1 score growth of 1.14%, 0.42%, 1.11% and IoU score growth of 1.68%, 0.76% and 1.64% respectively on Massachusetts data set, WHU data set and the Typical Cities Building data set.
科研通智能强力驱动
Strongly Powered by AbleSci AI