A serial semantic segmentation model based on encoder-decoder architecture

计算机科学 分割 卷积神经网络 人工智能 编码器 模式识别(心理学) 杠杆(统计) 可扩展性 计算机视觉 计算机工程 数据库 操作系统
作者
Yan Zhou
出处
期刊:Knowledge Based Systems [Elsevier]
卷期号:: 111819-111819
标识
DOI:10.1016/j.knosys.2024.111819
摘要

The thriving progress of Convolutional Neural Networks (CNNs) and the outstanding efficacy of Visual Transformers (ViTs) have delivered impressive outcomes in the domain of semantic segmentation. However, each model in isolation entails a trade-off between high computational complexity and compromised computational efficiency. To address this challenge, we effectively combine the CNN and encoder-decoder structures in a Transformer-inspired fashion, presenting the Serial Semantic Segmentation Trans via CNN Former (SSS-Former) model. To augment the feature extraction capability, we utilize the meticulously crafted SSS-CSPNet, resulting in a well-designed architecture for the holistic model. We propose a novel SSS-PN attention network that enhances the spatial topological connections of features, leading to improved overall performance. Additionally, the integration of SASPP bridges the semantic gap between multi-scale features and enhances segmentation ability for overlapping objects. To fulfill the requirement of real-time segmentation, we leverage a novel restructuring technique to devise a more lightweight and faster ResSSS-Former model. Abundant experimental results demonstrate that both SSS-Former and ResSSS-Former outperform existing state-of-the-art methods in terms of computational efficiency, result precision, and speed. Remarkably, SSS-Former achieves a mIoU of 58.63% at 89.1 FPS on the ADE20K dataset. On the validation and testing datasets of CityScapes, it obtains mIoU scores of 85.1% and 85.2% respectively, with a speed of 94.1 FPS. Our optimized ResSSS-Former achieves impressive real-time segmentation results, with an astonishing 100+ FPS while maintaining high segmentation accuracy. The compelling results from the ISPRS datasets further validate the effectiveness of our proposed models in segmenting multi-scale and overlapping objects.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
林伟发布了新的文献求助10
3秒前
超级明雪完成签到,获得积分10
4秒前
赵三木发布了新的文献求助10
5秒前
xiehui完成签到,获得积分10
5秒前
Hammerdai完成签到,获得积分10
5秒前
ANNE发布了新的文献求助10
6秒前
脑洞疼应助cccc1111111采纳,获得10
6秒前
kaworul发布了新的文献求助30
7秒前
11秒前
ANNE完成签到,获得积分10
13秒前
14秒前
爆米花应助科研通管家采纳,获得10
14秒前
脑洞疼应助科研通管家采纳,获得10
14秒前
科目三应助科研通管家采纳,获得10
14秒前
14秒前
kaworul完成签到,获得积分10
14秒前
bubu完成签到,获得积分10
15秒前
bob的美腿发布了新的文献求助10
17秒前
今后应助kaworul采纳,获得10
18秒前
18秒前
落叶完成签到 ,获得积分10
20秒前
领导范儿应助戴先森采纳,获得10
20秒前
桐桐应助bob的美腿采纳,获得10
24秒前
24秒前
25秒前
静谧180完成签到 ,获得积分10
26秒前
30秒前
tingalan发布了新的文献求助10
30秒前
SOLOMON应助fhhkckk3采纳,获得10
32秒前
cuijiawen发布了新的文献求助10
32秒前
Qixiner应助沉淀采纳,获得10
35秒前
犹豫的夏旋完成签到 ,获得积分10
36秒前
40秒前
gjww应助李剑鸿采纳,获得800
40秒前
42秒前
121314wld完成签到,获得积分10
43秒前
ixueyi完成签到,获得积分10
46秒前
弈心发布了新的文献求助10
48秒前
Hello应助清澜庭采纳,获得10
54秒前
高分求助中
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
Yuwu Song, Biographical Dictionary of the People's Republic of China 700
[Lambert-Eaton syndrome without calcium channel autoantibodies] 520
Sphäroguß als Werkstoff für Behälter zur Beförderung, Zwischen- und Endlagerung radioaktiver Stoffe - Untersuchung zu alternativen Eignungsnachweisen: Zusammenfassender Abschlußbericht 500
少脉山油柑叶的化学成分研究 430
Lung resection for non-small cell lung cancer after prophylactic coronary angioplasty and stenting: short- and long-term results 400
Revolutions 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2452868
求助须知:如何正确求助?哪些是违规求助? 2125087
关于积分的说明 5410727
捐赠科研通 1853993
什么是DOI,文献DOI怎么找? 922092
版权声明 562297
科研通“疑难数据库(出版商)”最低求助积分说明 493309