计算机科学
人工智能
卷积(计算机科学)
卷积神经网络
特征(语言学)
分割
判别式
图像分割
块(置换群论)
背景(考古学)
特征提取
模式识别(心理学)
增采样
自回归模型
推论
编码器
稳健性(进化)
云计算
八叉树
深度学习
核(代数)
特征学习
RGB颜色模型
数据挖掘
点云
计算机视觉
领域(数学分析)
可扩展性
机器学习
算法
空间分析
作者
Penghui Niu,Jiashuai She,Taotao Cai,Yajuan Zhang,Ping Zhang,Junhua Gu,Jianxin Li
标识
DOI:10.1109/tgrs.2026.3666092
摘要
Ground-based cloud image segmentation is a critical research domain for photovoltaic (PV) power forecasting. Current deep learning (DL) approaches primarily focus on encoder-decoder architectural refinements. However, existing methodologies exhibit several limitations: (1) they rely on dilated convolutions for multi-scale context extraction, yet fail to leverage inter-channel interoperability and partial feature efficacy; (2) implementations of attention-based feature enhancement frequently compromise the equilibrium between accuracy and throughput; and (3) the decoder modifications often fail to re-establish global interdependencies among hierarchical local features, thereby constraining inference efficiency. To mitigate these challenges, we propose MPCM-Net, a Multi-scale network that integrates Partial attention Convolutions with Mamba architectures to enhance segmentation accuracy. Specifically, the encoder incorporates a multi-scale partial attention convolution (MPAC), which comprises: (1) a multi-scale partial convolution block (MPC) with partial channel module (ParCM) and partial spatial module (ParSM) that facilitating global spatial interaction across multi-scale cloud formations, and (2) a multiscale partial attention block (MPA) combining partial attention module (ParAM) and ParSM to extract discriminative features with reduced computational complexity. On the decoder side, a multi-scale Mamba block (M2B) is employed to mitigate contextual loss through a spatial-semantic hybrid domain (SSHD) that maintains linear complexity while enabling deep feature aggregation across spatial and scale dimensions. Furthermore, we introduce and release a dataset incorporating Complex-Scale variations, Radiative properties, and Color attributes (CSRC), which is a clear-label, fine-grained segmentation benchmark designed to overcome the critical limitations of existing public datasets. Extensive empirical analysis on CSRC demonstrates the superior performance of MPCM-Net over state-of-the-art methods, achieving an optimal balance between segmentation accuracy and inference speed. The dataset and source code will be available at https://github.com/she1110/CSRC.
科研通智能强力驱动
Strongly Powered by AbleSci AI