MA-SAM: Modality-agnostic SAM adaptation for 3D medical image segmentation

分割计算机科学人工智能编码器计算机视觉医学影像学图像分割模态（人机交互）模式识别（心理学）操作系统

作者

Cheng Chen,Juzheng Miao,Dufan Wu,Aoxiao Zhong,Zhiling Yan,Sekeun Kim,Jiang Hu,Zhengliang Liu,Lichao Sun,Xiang Li,Tianming Liu,Pheng‐Ann Heng,Quanzheng Li

出处

期刊：Medical Image Analysis [Elsevier BV]
日期：2024-08-23 卷期号：98: 103310-103310 被引量：36

链接

arxiv.org arxiv.org nih.govdoi.org

标识

DOI：10.1016/j.media.2024.103310

摘要

The Segment Anything Model (SAM), a foundation model for general image segmentation, has demonstrated impressive zero-shot performance across numerous natural image segmentation tasks. However, SAM's performance significantly declines when applied to medical images, primarily due to the substantial disparity between natural and medical image domains. To effectively adapt SAM to medical images, it is important to incorporate critical third-dimensional information, i.e., volumetric or temporal knowledge, during fine-tuning. Simultaneously, we aim to harness SAM's pre-trained weights within its original 2D backbone to the fullest extent. In this paper, we introduce a modality-agnostic SAM adaptation framework, named as MA-SAM, that is applicable to various volumetric and video medical data. Our method roots in the parameter-efficient fine-tuning strategy to update only a small portion of weight increments while preserving the majority of SAM's pre-trained weights. By injecting a series of 3D adapters into the transformer blocks of the image encoder, our method enables the pre-trained 2D backbone to extract third-dimensional information from input data. We comprehensively evaluate our method on five medical image segmentation tasks, by using 11 public datasets across CT, MRI, and surgical video data. Remarkably, without using any prompt, our method consistently outperforms various state-of-the-art 3D approaches, surpassing nnU-Net by 0.9%, 2.6%, and 9.9% in Dice for CT multi-organ segmentation, MRI prostate segmentation, and surgical scene segmentation respectively. Our model also demonstrates strong generalization, and excels in challenging tumor segmentation when prompts are used. Our code is available at: https://github.com/cchen-cc/MA-SAM.

求助该文献

最长约 10秒，即可获得该文献文件

MA-SAM: Modality-agnostic SAM adaptation for 3D medical image segmentation

今日热心研友