计算机科学
遥感
概化理论
特征提取
语义学(计算机科学)
人工智能
计算机视觉
光学(聚焦)
像素
图像(数学)
任务(项目管理)
特征(语言学)
图像分辨率
上下文图像分类
钥匙(锁)
遥感应用
语义计算
航程(航空)
基于知识的系统
信息抽取
语义记忆
情报检索
传输(电信)
语义映射
背景(考古学)
知识抽取
语义网格
数据挖掘
语义网络
稳健性(进化)
作者
Bowen Chen,Keyan Chen,Mohan Yang,Zhengxia Zou,Zhenwei Shi
标识
DOI:10.1109/tgrs.2025.3612420
摘要
High-resolution (HR) remote sensing imagery plays a vital role in a wide range of applications, including urban planning and environmental monitoring. However, due to limitations in sensors and data transmission links, the images acquired in practice often suffer from resolution degradation. Remote Sensing Image Super-Resolution (RSISR) aims to reconstruct HR images from low-resolution (LR) inputs, providing a cost-effective and efficient alternative to direct HR image acquisition. Existing RSISR methods primarily focus on low-level characteristics in pixel space, while neglecting the high-level understanding of remote sensing scenes. This may lead to semantically inconsistent artifacts in the reconstructed results. Motivated by this observation, our work aims to explore the role of high-level semantic knowledge in improving RSISR performance. We propose a Semantic-Guided Super-Resolution framework, SeG-SR, which leverages Vision-Language Models (VLMs) to extract semantic knowledge from input images and uses it to guide the super resolution (SR) process. Specifically, we first design a Semantic Feature Extraction Module (SFEM) that utilizes a pretrained VLM to extract semantic knowledge from remote sensing images. Next, we propose a Semantic Localization Module (SLM), which derives a series of semantic guidance from the extracted semantic knowledge. Finally, we develop a Learnable Modulation Module (LMM) that uses semantic guidance to modulate the features extracted by the SR network, effectively incorporating high-level scene understanding into the SR pipeline. We validate the effectiveness and generalizability of SeG-SR through extensive experiments: SeG-SR achieves state-of-the-art performance on three datasets, and consistently improves performance across various SR architectures. Notably, for the ×4 SR task on the UCMerced dataset, it attained a PSNR of 29.3042 dB and an SSIM of 0.7961. Codes can be found at https://github.com/Mr-Bamboo/SeG-SR.
科研通智能强力驱动
Strongly Powered by AbleSci AI