计算机科学
特征(语言学)
比例(比率)
人工智能
噪音(视频)
突出
模式识别(心理学)
GSM演进的增强数据速率
融合
图像融合
计算机视觉
数据挖掘
图像(数学)
物理
哲学
量子力学
语言学
作者
Lamei Di,Bin Zhang,Yiming Wang
标识
DOI:10.1109/tgrs.2024.3403268
摘要
Recent advancements have significantly benefited in salient object detection for optical remote sensing images (ORSI-SOD). Given the varying spatial resolutions and complex scenes characteristic of optical remote sensing images (ORSI), leveraging and integrating features across scales is vital. However, excessive feature integration can introduce significant noise and result in inaccurate saliency mapping. To address this issue, we propose the Multi-scale and Multi-dimensional Weighted Network for ORSI-SOD (WeightNet). The network adopts a two-stage design where the first stage generates multi-scale weighted information, and the second stage conducts indirect multi-scale feature weighted fusion. This design skillfully avoids the lack of scale adaptation and noise interference that may arise from direct multi-scale feature fusion. Furthermore, to enhance feature fusion and target localization precision, we introduce the Multi-Scale Weighted Feature Aggregation Module (MWFAM) and the Multi-Dimensional Feature Guidance Module (MDFGM). MWFAM facilitates multi-scale feature fusion while minimizing noise from cross-layer interactions. MDFGM specializes in precise target localization and enhancement of detail and edge information. Additionally, the introduction of a Multi-Scale Parallel Decoder (MPD) significantly boosts the decoder's capability in identifying targets across various scales. Extensive qualitative and quantitative evaluations on three public ORSI datasets demonstrate the effectiveness and superiority of WeightNet over contemporary state-of-the-art models.
科研通智能强力驱动
Strongly Powered by AbleSci AI