计算机科学
分割
人工智能
特征(语言学)
卷积神经网络
模式识别(心理学)
深度学习
编码(集合论)
数据挖掘
语言学
哲学
集合(抽象数据类型)
程序设计语言
作者
Yuanyuan Zhang,Lin Liu,Ziyi Han,Fan-Yun Meng,Yulin Zhang,Yawu Zhao
标识
DOI:10.1016/j.bspc.2023.105133
摘要
Fully convolutional neural (FCN) networks like U-Net have been the state-of-the-art methods in colorectal polyp segmentation. However, U-Net still has some limitations in modelling remote semantic information, especially since the semantic information at different levels can vary greatly, making it difficult to utilize this information fully. To address these issues, we propose a new network architecture called TranSEFusionNet that utilizes the Transformer's global modelling capability to better focus on global contextual semantic information. In addition, we added two feature fusion modules, a spatial feature fusion module (SFM) and an edge feature fusion module (EFM), to the network. SEF with a skip connection can improve the accuracy of passing deep features to shallow features. EFM in the output part of each decoder layer improves the recognition of edge ambiguous features by refining the semantic information of the network. We validate the model's performance on five publicly available colorectal polyp datasets, and the experiments show that TranSEFusionNet has higher segmentation accuracy. To measure the generalization ability of TranSEFusionNet, we further applied the model to the cell nuclei dataset, which further verifies the performance of our model. Code: https://github.com/Linaaalin/TranSEFusionNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI