MAXFormer: Enhanced transformer for medical image segmentation with multi-attention and multi-scale features fusion

计算机科学 增采样 卷积神经网络 分割 人工智能 编码器 地点 图像分割 失败 模式识别(心理学) 图像(数学) 并行计算 语言学 操作系统 哲学
作者
Zhiwei Liang,Kui Zhao,Gang Liang,Siyu Li,Yifei Wu,Yiping Zhou
出处
期刊:Knowledge Based Systems [Elsevier]
卷期号:280: 110987-110987 被引量:34
标识
DOI:10.1016/j.knosys.2023.110987
摘要

Convolutional neural networks(CNN), especially U-shaped networks, have become the mainstream approach for medical image segmentation. However, due to the intrinsic locality of convolutional operations, CNN has inherent limitations in capturing long-range dependencies. Although Transformer-based methods have demonstrated remarkable performance in computer vision by modeling long-range dependencies, their high computational complexity and reliance on large-scale pre-training present challenges, particularly for higher-resolution medical images. In this paper, we introduce MAXFormer, a U-shaped hierarchical network that effectively leverages global context within individual samples and relationships between different samples. Our Transformer module reformulates the self-attention mechanism into two parts: local–global attention and external attention. The local–global attention provides an efficient alternative to self-attention with linear complexity, employing a parallel architecture that allows local–global spatial interactions. The local attention branch captures high-frequency local information, while the global attention branch captures low-frequency global information. Furthermore, we have designed the Refined Fused Connection module to effectively merge feature outputs from each encoder block with the decoder output, mitigating spatial detail loss due to downsampling. Extensive experiments on two different medical image segmentation datasets show that our proposed method outperforms other state-of-the-art methods without requiring pre-training weights. Code will be available at https://github.com/zhiwei-liang/MAXFormer.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
歪歪象发布了新的文献求助10
刚刚
刚刚
林雾发布了新的文献求助10
1秒前
1秒前
科研通AI6.1应助因垂丝汀采纳,获得10
1秒前
t东流水发布了新的文献求助10
1秒前
科研通AI6.1应助苗条菠萝采纳,获得10
1秒前
zhengj发布了新的文献求助10
2秒前
CYANjane应助常大美女采纳,获得10
2秒前
怕黑啤酒完成签到,获得积分20
2秒前
2秒前
慕青应助JFP采纳,获得10
2秒前
福福发布了新的文献求助10
2秒前
Papillon完成签到,获得积分10
2秒前
3秒前
3秒前
3秒前
Hi发布了新的文献求助10
3秒前
无忧sxt完成签到 ,获得积分10
4秒前
等好消息完成签到,获得积分10
4秒前
扈1发布了新的文献求助10
5秒前
5秒前
瓦瓦发布了新的文献求助50
5秒前
阿发发布了新的文献求助10
5秒前
大个应助安静代萱采纳,获得10
5秒前
太渊完成签到 ,获得积分10
5秒前
5秒前
6秒前
6秒前
学海无涯完成签到,获得积分10
6秒前
SYX发布了新的文献求助10
6秒前
马克发布了新的文献求助10
7秒前
科研通AI6.1应助huahua采纳,获得10
7秒前
ding应助研友_赖冰凡采纳,获得10
7秒前
7秒前
lingzi1015完成签到,获得积分10
7秒前
stay发布了新的文献求助20
8秒前
8秒前
烟花应助yx采纳,获得10
8秒前
8秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Molecular Biology of Cancer: Mechanisms, Targets, and Therapeutics 3000
Kinesiophobia : a new view of chronic pain behavior 3000
Les Mantodea de guyane 2500
Feldspar inclusion dating of ceramics and burnt stones 1000
What is the Future of Psychotherapy in a Digital Age? 801
The Psychological Quest for Meaning 800
热门求助领域 (近24小时)
化学 材料科学 生物 医学 工程类 计算机科学 有机化学 物理 生物化学 纳米技术 复合材料 内科学 化学工程 人工智能 催化作用 遗传学 数学 基因 量子力学 物理化学
热门帖子
关注 科研通微信公众号,转发送积分 5961718
求助须知:如何正确求助?哪些是违规求助? 7216127
关于积分的说明 15960526
捐赠科研通 5098201
什么是DOI,文献DOI怎么找? 2739249
邀请新用户注册赠送积分活动 1701624
关于科研通互助平台的介绍 1619089