A Survey on Visual Mamba

调查研究 心理学 应用心理学
作者
Hanwei Zhang,Ying Zhu,Dingping Wang,Lijun Zhang,Tianxiang Chen,Zi Yan
出处
期刊:Cornell University - arXiv
标识
DOI:10.48550/arxiv.2404.15956
摘要

State space models (SSMs) with selection mechanisms and hardware-aware architectures, namely Mamba, have recently demonstrated significant promise in long-sequence modeling. Since the self-attention mechanism in transformers has quadratic complexity with image size and increasing computational demands, the researchers are now exploring how to adapt Mamba for computer vision tasks. This paper is the first comprehensive survey aiming to provide an in-depth analysis of Mamba models in the field of computer vision. It begins by exploring the foundational concepts contributing to Mamba's success, including the state space model framework, selection mechanisms, and hardware-aware design. Next, we review these vision mamba models by categorizing them into foundational ones and enhancing them with techniques such as convolution, recurrence, and attention to improve their sophistication. We further delve into the widespread applications of Mamba in vision tasks, which include their use as a backbone in various levels of vision processing. This encompasses general visual tasks, Medical visual tasks (e.g., 2D / 3D segmentation, classification, and image registration, etc.), and Remote Sensing visual tasks. We specially introduce general visual tasks from two levels: High/Mid-level vision (e.g., Object detection, Segmentation, Video classification, etc.) and Low-level vision (e.g., Image super-resolution, Image restoration, Visual generation, etc.). We hope this endeavor will spark additional interest within the community to address current challenges and further apply Mamba models in computer vision.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
粗心的羽毛应助Gleast采纳,获得10
1秒前
潦草小狗发布了新的文献求助10
3秒前
彭于晏应助三驾马车采纳,获得10
3秒前
3秒前
科研狗发布了新的文献求助10
3秒前
Yezi完成签到,获得积分20
5秒前
科研通AI6.2应助Leslie采纳,获得10
5秒前
小白发布了新的文献求助10
6秒前
123发布了新的文献求助10
6秒前
jaja发布了新的文献求助10
6秒前
7秒前
完美世界应助kklove采纳,获得10
7秒前
7秒前
8秒前
Hello应助511采纳,获得10
9秒前
而当下的发布了新的文献求助10
9秒前
9秒前
烟花应助星期采纳,获得10
9秒前
爆米花应助ljr采纳,获得10
9秒前
碧赴应助惊鸿客采纳,获得100
10秒前
Han关闭了Han文献求助
10秒前
10秒前
隐形曼青应助刘的花采纳,获得10
10秒前
11秒前
12秒前
Lebesgue完成签到 ,获得积分10
12秒前
13秒前
bkagyin应助huna0004采纳,获得10
13秒前
若珊完成签到,获得积分20
13秒前
碧赴发布了新的文献求助10
13秒前
14秒前
ddd发布了新的文献求助10
14秒前
Camellia发布了新的文献求助10
14秒前
Sammos应助自由寻冬采纳,获得10
15秒前
orixero应助Chaimengdi采纳,获得10
15秒前
15秒前
充电宝应助VANGOGH采纳,获得10
15秒前
丁一发布了新的文献求助10
15秒前
lsl应助狗熊采纳,获得10
15秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Materials selection in mechanical design 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6479617
求助须知:如何正确求助?哪些是违规求助? 8280673
关于积分的说明 17662047
捐赠科研通 5562338
什么是DOI,文献DOI怎么找? 2911427
邀请新用户注册赠送积分活动 1888509
关于科研通互助平台的介绍 1742681