图像分割
人工智能
分割
计算机科学
医学影像学
计算机视觉
图像(数学)
动力学(音乐)
尺度空间分割
模式识别(心理学)
心理学
教育学
作者
Shengjie Zhang,Xin Shen,Xiang Chen,Ziqi Yu,Bohan Ren,Haibo Yang,Xiao‐Yong Zhang,Yuan Zhou
标识
DOI:10.1109/tmi.2024.3477555
摘要
Prevalent studies on deep learning-based 3D medical image segmentation capture the continuous variation across 2D slices mainly via convolution, Transformer, inter-slice interaction, and time series models. In this work, via modeling this variation by an ordinary differential equation (ODE), we propose a cross instance query-guided Transformer architecture (CQformer) that leverages features from preceding slices to improve the segmentation performance of subsequent slices. Its key components include a cross-attention mechanism in an ODE formulation, which bridges the features of contiguous 2D slices of the 3D volumetric data. In addition, a regression head is employed to shorten the gap between the bottleneck and the prediction layer. Extensive experiments on 7 datasets with various modalities (CT, MRI) and tasks (organ, tissue, and lesion) demonstrate that CQformer outperforms previous state-of-the-art segmentation algorithms on 6 datasets by 0.44%-2.45%, and achieves the second highest performance of 88.30% on the BTCV dataset. The code will be publicly available after acceptance.
科研通智能强力驱动
Strongly Powered by AbleSci AI