计算机科学
图像拼接
人工智能
矢量化(数学)
遥感
杠杆(统计)
分割
计算机视觉
块(置换群论)
图像分割
棱锥(几何)
语义映射
图像融合
约束(计算机辅助设计)
地理空间分析
稳健性(进化)
特征提取
人工神经网络
集合(抽象数据类型)
支持向量机
矢量地图
数据挖掘
水准点(测量)
特征向量
像素
边界(拓扑)
矢量量化
模式识别(心理学)
上下文图像分类
点(几何)
特征(语言学)
作者
Yansheng Li,Wanchun Li,Bo Dang,Yu Wang,Wei Chen,L D WANG,Bingnan Yang,Yongjun Zhang
标识
DOI:10.1109/tpami.2026.3660934
摘要
Large-size very-high-resolution (VHR) remote sensing imagery has emerged as a critical data source for high-precision vector mapping of multi-scale geographical elements such as building, water, road and etc. When dealing with the large-size image, due to the limited memory of GPU, the deep learning-based vector mapping methods often employ the sliding block strategy. This inevitably leads to the degenerated performance because of the stitching difficulty of the sliding blocks' vector mapping results. Therefore, it is necessary to conduct full-scope vector mapping via mining the consistent cue in large-size remote sensing imagery. To this end, this paper presents a novel global context-aware local point optimization method. To leverage the global context, this paper proposes a novel pyramid fusion network (PFNet) to conduct semantic segmentation of the large-size image in an end-to-end manner. Under the constraint of the global semantic segmentation result, a new inflection-point perception network (IPNet) is proposed to generate a set of stable points to depict the boundary of each element. Extensive experiments on building, water and road datasets, where each image has over 100 million pixels, show that our method obviously outperforms the existing methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI