航程(航空)
分割
图像(数学)
计算机科学
计算机视觉
人工智能
工程类
航空航天工程
作者
Juntao Jiang,Jiangning Zhang,W.J. Liu,Ming Gao,Xiaobin Hu,Xiaoxiao Yan,Feiyue Huang,Yong Liu
出处
期刊:Cornell University - arXiv
日期:2025-01-14
被引量:3
标识
DOI:10.48550/arxiv.2501.08458
摘要
In recent years, significant advancements have been made in deep learning for medical image segmentation, particularly with convolutional neural networks (CNNs) and transformer models. However, CNNs face limitations in capturing long-range dependencies, while transformers suffer from high computational complexity. To address this, we propose RWKV-UNet, a novel model that integrates the RWKV (Receptance Weighted Key Value) structure into the U-Net architecture. This integration enhances the model's ability to capture long-range dependencies and to improve contextual understanding, which is crucial for accurate medical image segmentation. We build a strong encoder with developed Global-Local Spatial Perception (GLSP) blocks combining CNNs and RWKVs. We also propose a Cross-Channel Mix (CCM) module to improve skip connections with multi-scale feature fusion, achieving global channel information integration. Experiments on 11 benchmark datasets show that the RWKV-UNet achieves state-of-the-art performance on various types of medical image segmentation tasks. Additionally, smaller variants, RWKV-UNet-S and RWKV-UNet-T, balance accuracy and computational efficiency, making them suitable for broader clinical applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI