遥感
计算机科学
光学(聚焦)
卷积(计算机科学)
像素
编码器
计算机视觉
人工智能
特征提取
图像分辨率
人工神经网络
地理
操作系统
光学
物理
作者
Ling Dai,Guangyun Zhang,Rongting Zhang
标识
DOI:10.1109/tgrs.2023.3237561
摘要
Extracting roads from complex high-resolution remote sensing images to update road networks has become a recent research focus. How to apply the contextual spatial correlation and topological structure of the roads properly to improve the extraction accuracy becomes a challenge in the increasingly complex road environment. In this article, inspired by the prior knowledge of the road shape and the progress in deformable convolution, we proposed a road augmented deformable attention network (RADANet) to learn the long-range dependencies for specific road pixels. We developed a road augmentation module (RAM) to capture the semantic shape information of the road from four strip convolutions. Deformable attention module (DAM) combines the sparse sampling capability of deformable convolution with the spatial self-attention mechanism. The integration of RAM enables DAM to extract road features more specifically. Furthermore, RAM is placed behind the fourth stage of encoder, and DAM is placed between last four stages of encoder and decoder in RADANet to extract multiscale road semantic information. Comprehensive experiments on representative public datasets (DeepGlobe and CHN6-CUG road datasets) demonstrate that our RADANet achieves advanced results compared with the state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI