计算机科学
人工智能
分割
卷积神经网络
尺度空间分割
图像分割
背景(考古学)
深度学习
模式识别(心理学)
卷积(计算机科学)
计算机视觉
基于分割的对象分类
图像分辨率
人工神经网络
生物
古生物学
作者
Hao Tang,Xuemei Liu,Kun Han,Xiaohui Xie,Xuming Chen,Qian Huang,Yong Liu,Shanlin Sun,Narisu Bai
标识
DOI:10.1109/wacv48630.2021.00098
摘要
Multi-organ segmentation is one of most successful applications of deep learning in medical image analysis. Deep convolutional neural nets (CNNs) have shown great promise in achieving clinically applicable image segmentation performance on CT or MRI images. State-of-the-art CNN segmentation models apply either 2D or 3D convolutions on input images, with pros and cons associated with each method: 2D convolution is fast, less memory-intensive but inadequate for extracting 3D contextual information from volumetric images, while the opposite is true for 3D convolution. To fit a 3D CNN model on CT or MRI images on commodity GPUs, one usually has to either downsample input images or use cropped local regions as inputs, which limits the utility of 3D models for multi-organ segmentation. In this work, we propose a new framework for combining 3D and 2D models, in which the segmentation is realized through high-resolution 2D convolutions, but guided by spatial contextual information extracted from a low-resolution 3D model. We implement a self-attention mechanism to control which 3D features should be used to guide 2D segmentation. Our model is light on memory usage but fully equipped to take 3D contextual information into account. Experiments on multiple organ segmentation datasets demonstrate that by taking advantage of both 2D and 3D models, our method is consistently outperforms existing 2D and 3D models in organ segmentation accuracy, while being able to directly take raw whole-volume image data as inputs.
科研通智能强力驱动
Strongly Powered by AbleSci AI