棱锥(几何)
分割
计算机科学
图像(数学)
人工智能
医学知识
图像分割
计算机视觉
业务
医学
数学
医学教育
几何学
作者
Xianjun Han,Tiantian Li,Hongyu Yang
摘要
Medical image segmentation is crucial for accurate diagnoses, and while convolutional neural network (CNN)-based methods have made strides in recent years, they struggle with modeling long-range dependencies. Transformer-based methods improve this aspect but require more computational resources. The Segment Anything Model (SAM) can generate pixel-level segmentation in natural images using sparse manual prompts, but it performs poorly on low-contrast, noisy ultrasound images. To address this issue, we propose a new medical image segmentation network architecture that integrates the transformer components, CNN modules, and SAM encoder into a unified framework. This allows us to capture both long-range dependencies and local features simultaneously. Additionally, we incorporate the extracted image features from the SAM model as prior knowledge to further improve segmentation accuracy with limited training data. To reduce computational stress, we employ the axial attention mechanism to approximate the transformer's effects by expanding the receptive field. Instead of replacing transformer components with lightweight attention modules, our model is divided into a global branch and a local branch. The global branch extracts context features with the transformer components, while the local branch processes patch tokens with the axial attention mechanism. We also construct an image pyramid to excavate internal statistics and multiscale representations to obtain more accurate segmentation regions. This bi-branch pyramid transformer (Bi-BPT) architecture is effective and robust for medical image segmentation, surpassing other related segmentation network architectures. The experimental results on various medical image datasets demonstrate its effectiveness.
科研通智能强力驱动
Strongly Powered by AbleSci AI