计算机科学
增采样
分割
编码器
人工智能
变压器
图像分割
利用
源代码
特征(语言学)
计算机视觉
模式识别(心理学)
图像(数学)
物理
哲学
操作系统
量子力学
语言学
电压
计算机安全
作者
Wenxuan Wang,Chen Chen,Meng Ding,Hong Yu,Sen Zha,Jiangyun Li
标识
DOI:10.1007/978-3-030-87193-2_11
摘要
Transformer, which can benefit from global (long-range) information modeling using self-attention mechanisms, has been successful in natural language processing and 2D image classification recently. However, both local and global features are crucial for dense prediction tasks, especially for 3D medical image segmentation. In this paper, we for the first time exploit Transformer in 3D CNN for MRI Brain Tumor Segmentation and propose a novel network named TransBTS based on the encoder-decoder structure. To capture the local 3D context information, the encoder first utilizes 3D CNN to extract the volumetric spatial feature maps. Meanwhile, the feature maps are reformed elaborately for tokens that are fed into Transformer for global feature modeling. The decoder leverages the features embedded by Transformer and performs progressive upsampling to predict the detailed segmentation map. Extensive experimental results on both BraTS 2019 and 2020 datasets show that TransBTS achieves comparable or higher results than previous state-of-the-art 3D methods for brain tumor segmentation on 3D MRI scans. The source code is available at https://github.com/Wenxuan-1119/TransBTS.
科研通智能强力驱动
Strongly Powered by AbleSci AI