MPCM-Net: A Multiscale Network That Integrates Partial Attention Convolution With Mamba for Ground-Based Cloud Image Segmentation

计算机科学人工智能卷积（计算机科学）卷积神经网络特征（语言学）分割判别式图像分割块（置换群论）背景（考古学）特征提取模式识别（心理学）增采样自回归模型推论编码器稳健性（进化）云计算八叉树深度学习核（代数）特征学习 RGB颜色模型数据挖掘点云计算机视觉领域（数学分析）可扩展性机器学习算法空间分析

作者

Penghui Niu,Jiashuai She,Taotao Cai,Yajuan Zhang,Ping Zhang,Junhua Gu,Jianxin Li

出处

期刊：IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
日期：2026-01-01 卷期号：64: 1-16

标识

DOI：10.1109/tgrs.2026.3666092

摘要

Ground-based cloud image segmentation is a critical research domain for photovoltaic (PV) power forecasting. Current deep learning (DL) approaches primarily focus on encoder-decoder architectural refinements. However, existing methodologies exhibit several limitations: (1) they rely on dilated convolutions for multi-scale context extraction, yet fail to leverage inter-channel interoperability and partial feature efficacy; (2) implementations of attention-based feature enhancement frequently compromise the equilibrium between accuracy and throughput; and (3) the decoder modifications often fail to re-establish global interdependencies among hierarchical local features, thereby constraining inference efficiency. To mitigate these challenges, we propose MPCM-Net, a Multi-scale network that integrates Partial attention Convolutions with Mamba architectures to enhance segmentation accuracy. Specifically, the encoder incorporates a multi-scale partial attention convolution (MPAC), which comprises: (1) a multi-scale partial convolution block (MPC) with partial channel module (ParCM) and partial spatial module (ParSM) that facilitating global spatial interaction across multi-scale cloud formations, and (2) a multiscale partial attention block (MPA) combining partial attention module (ParAM) and ParSM to extract discriminative features with reduced computational complexity. On the decoder side, a multi-scale Mamba block (M2B) is employed to mitigate contextual loss through a spatial-semantic hybrid domain (SSHD) that maintains linear complexity while enabling deep feature aggregation across spatial and scale dimensions. Furthermore, we introduce and release a dataset incorporating Complex-Scale variations, Radiative properties, and Color attributes (CSRC), which is a clear-label, fine-grained segmentation benchmark designed to overcome the critical limitations of existing public datasets. Extensive empirical analysis on CSRC demonstrates the superior performance of MPCM-Net over state-of-the-art methods, achieving an optimal balance between segmentation accuracy and inference speed. The dataset and source code will be available at https://github.com/she1110/CSRC.

求助该文献

最长约 10秒，即可获得该文献文件

MPCM-Net: A Multiscale Network That Integrates Partial Attention Convolution With Mamba for Ground-Based Cloud Image Segmentation

今日热心研友