SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

计算机科学 人工智能 特征(语言学) 编码器 增采样 卷积神经网络 联营 像素 水准点(测量) 分割 模式识别(心理学) 网络体系结构 图像分割 深度学习 计算机视觉 图像(数学) 哲学 操作系统 语言学 计算机安全 地理 大地测量学
作者
Vijay Badrinarayanan,Roberto Cipolla
出处
期刊:IEEE Transactions on Pattern Analysis and Machine Intelligence [Institute of Electrical and Electronics Engineers]
卷期号:39 (12): 2481-2495 被引量:12287
标识
DOI:10.1109/tpami.2016.2644615
摘要

We present a novel and practical deep fully convolutional neural network architecture for semantic pixel-wise segmentation termed SegNet. This core trainable segmentation engine consists of an encoder network, a corresponding decoder network followed by a pixel-wise classification layer. The architecture of the encoder network is topologically identical to the 13 convolutional layers in the VGG16 network [1] . The role of the decoder network is to map the low resolution encoder feature maps to full input resolution feature maps for pixel-wise classification. The novelty of SegNet lies is in the manner in which the decoder upsamples its lower resolution input feature map(s). Specifically, the decoder uses pooling indices computed in the max-pooling step of the corresponding encoder to perform non-linear upsampling. This eliminates the need for learning to upsample. The upsampled maps are sparse and are then convolved with trainable filters to produce dense feature maps. We compare our proposed architecture with the widely adopted FCN [2] and also with the well known DeepLab-LargeFOV [3] , DeconvNet [4] architectures. This comparison reveals the memory versus accuracy trade-off involved in achieving good segmentation performance. SegNet was primarily motivated by scene understanding applications. Hence, it is designed to be efficient both in terms of memory and computational time during inference. It is also significantly smaller in the number of trainable parameters than other competing architectures and can be trained end-to-end using stochastic gradient descent. We also performed a controlled benchmark of SegNet and other architectures on both road scenes and SUN RGB-D indoor scene segmentation tasks. These quantitative assessments show that SegNet provides good performance with competitive inference time and most efficient inference memory-wise as compared to other architectures. We also provide a Caffe implementation of SegNet and a web demo at http://mi.eng.cam.ac.uk/projects/segnet/.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
滴滴哩哩完成签到,获得积分10
2秒前
aaa发布了新的文献求助30
3秒前
bkagyin应助千里江山一只蝇采纳,获得10
4秒前
杨然发布了新的文献求助10
6秒前
白啊啊啊啊啊完成签到 ,获得积分10
7秒前
8秒前
大模型应助Betty采纳,获得10
10秒前
14秒前
研友_VZG7GZ应助aaa采纳,获得10
17秒前
可爱的函函应助浮云采纳,获得10
18秒前
。。。发布了新的文献求助10
19秒前
仁爱芷波完成签到,获得积分10
19秒前
TF邓佳鑫应助余钱半两采纳,获得10
21秒前
赘婿应助科研通管家采纳,获得10
21秒前
科研通AI2S应助科研通管家采纳,获得10
21秒前
Jasper应助科研通管家采纳,获得10
21秒前
秋雪瑶应助科研通管家采纳,获得10
21秒前
科研通AI2S应助科研通管家采纳,获得10
22秒前
orixero应助科研通管家采纳,获得10
22秒前
Hello应助科研通管家采纳,获得10
22秒前
22秒前
22秒前
22秒前
Lucas应助科研通管家采纳,获得10
22秒前
23秒前
斯文败类应助舒适的太君采纳,获得10
23秒前
24秒前
24秒前
25秒前
26秒前
volvoamg发布了新的文献求助10
27秒前
27秒前
27秒前
无妍完成签到,获得积分10
29秒前
浮云发布了新的文献求助10
30秒前
鲁西西发布了新的文献求助10
30秒前
Li发布了新的文献求助10
31秒前
yc发布了新的文献求助10
31秒前
yulian完成签到,获得积分10
32秒前
33秒前
高分求助中
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Illustrated History of Gymnastics 800
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Herman Melville: A Biography (Volume 1, 1819-1851) 600
Division and square root. Digit-recurrence algorithms and implementations 500
Hemerologies of Assyrian and Babylonian Scholars 500
Manual of Clinical Microbiology, 13th Edition 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2499168
求助须知:如何正确求助?哪些是违规求助? 2154592
关于积分的说明 5510984
捐赠科研通 1875415
什么是DOI,文献DOI怎么找? 932731
版权声明 563762
科研通“疑难数据库(出版商)”最低求助积分说明 498432