A Survey on Visual Mamba

调查研究 心理学 应用心理学
作者
Hanwei Zhang,Ying Zhu,Dingping Wang,Lijun Zhang,Tianxiang Chen,Zi Yan
出处
期刊:Cornell University - arXiv
标识
DOI:10.48550/arxiv.2404.15956
摘要

State space models (SSMs) with selection mechanisms and hardware-aware architectures, namely Mamba, have recently demonstrated significant promise in long-sequence modeling. Since the self-attention mechanism in transformers has quadratic complexity with image size and increasing computational demands, the researchers are now exploring how to adapt Mamba for computer vision tasks. This paper is the first comprehensive survey aiming to provide an in-depth analysis of Mamba models in the field of computer vision. It begins by exploring the foundational concepts contributing to Mamba's success, including the state space model framework, selection mechanisms, and hardware-aware design. Next, we review these vision mamba models by categorizing them into foundational ones and enhancing them with techniques such as convolution, recurrence, and attention to improve their sophistication. We further delve into the widespread applications of Mamba in vision tasks, which include their use as a backbone in various levels of vision processing. This encompasses general visual tasks, Medical visual tasks (e.g., 2D / 3D segmentation, classification, and image registration, etc.), and Remote Sensing visual tasks. We specially introduce general visual tasks from two levels: High/Mid-level vision (e.g., Object detection, Segmentation, Video classification, etc.) and Low-level vision (e.g., Image super-resolution, Image restoration, Visual generation, etc.). We hope this endeavor will spark additional interest within the community to address current challenges and further apply Mamba models in computer vision.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
霸气早晨完成签到,获得积分10
2秒前
hhhh应助跳跃初露采纳,获得10
2秒前
3秒前
LJ徽完成签到 ,获得积分10
3秒前
冷酷思远完成签到 ,获得积分10
6秒前
一点就通完成签到,获得积分10
7秒前
8秒前
华毒娘发布了新的文献求助10
9秒前
10秒前
段鹏鹏发布了新的文献求助10
11秒前
DaLu完成签到,获得积分10
12秒前
17秒前
22发布了新的文献求助10
17秒前
18秒前
18秒前
19秒前
20秒前
小棉背心完成签到 ,获得积分10
20秒前
冉然完成签到,获得积分10
21秒前
hhhh应助22采纳,获得30
22秒前
22秒前
麻祖完成签到 ,获得积分10
22秒前
一轮太阳和幻想完成签到,获得积分10
23秒前
健忘半邪发布了新的文献求助10
23秒前
冉然发布了新的文献求助10
24秒前
桐桐应助Arkhamk采纳,获得10
24秒前
完美世界应助二十八画生采纳,获得10
25秒前
28秒前
29秒前
Pretrial完成签到 ,获得积分10
29秒前
30秒前
Ava应助科研通管家采纳,获得10
30秒前
隐形曼青应助科研通管家采纳,获得10
30秒前
深情安青应助科研通管家采纳,获得10
30秒前
30秒前
30秒前
sky123应助科研通管家采纳,获得10
30秒前
懵懂的曼寒完成签到,获得积分10
30秒前
30秒前
乐乐应助自由的灯泡采纳,获得10
30秒前
高分求助中
请在求助之前详细阅读求助说明!!!! 20000
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
A radiographic standard of reference for the growing knee 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2476891
求助须知:如何正确求助?哪些是违规求助? 2140774
关于积分的说明 5456553
捐赠科研通 1864131
什么是DOI,文献DOI怎么找? 926706
版权声明 562846
科研通“疑难数据库(出版商)”最低求助积分说明 495833