人工智能
计算机科学
分割
正电子发射断层摄影术
模式识别(心理学)
特征提取
一致性(知识库)
模态(人机交互)
特征(语言学)
放射科
医学
语言学
哲学
作者
Zhongliang Xue,Ping Li,Liang Zhang,Xiaoyuan Lu,Guangming Zhu,Peiyi Shen,Syed Afaq Ali Shah,Mohammed Bennamoun
标识
DOI:10.1109/tmi.2021.3089702
摘要
Liver lesion segmentation is an essential process to assist doctors in hepatocellular carcinoma diagnosis and treatment planning. Multi-modal positron emission tomography and computed tomography (PET-CT) scans are widely utilized due to their complementary feature information for this purpose. However, current methods ignore the interaction of information across the two modalities during feature extraction, omit the co-learning of the feature maps of different resolutions, and do not ensure that shallow and deep features complement each others sufficiently. In this paper, our proposed model can achieve feature interaction across multi-modal channels by sharing the down-sampling blocks between two encoding branches to eliminate misleading features. Furthermore, we combine feature maps of different resolutions to derive spatially varying fusion maps and enhance the lesions information. In addition, we introduce a similarity loss function for consistency constraint in case that predictions of separated refactoring branches for the same regions vary a lot. We evaluate our model for liver tumor segmentation using a PET-CT scans dataset, compare our method with the baseline techniques for multi-modal (multi-branches, multi-channels and cascaded networks) and then demonstrate that our method has a significantly higher accuracy ( ${p} < 0.05$ ) than the baseline models.
科研通智能强力驱动
Strongly Powered by AbleSci AI