计算机科学
分割
失败
编码器
光学(聚焦)
人工智能
像素
人工神经网络
图像分割
建筑
国家(计算机科学)
网络体系结构
计算机视觉
模式识别(心理学)
并行计算
算法
视觉艺术
艺术
物理
光学
操作系统
计算机安全
作者
Abhishek Chaurasia,Eugenio Culurciello
标识
DOI:10.1109/vcip.2017.8305148
摘要
Pixel-wise semantic segmentation for visual scene understanding not only needs to be accurate, but also efficient in order to find any use in real-time application. Existing algorithms even though are accurate but they do not focus on utilizing the parameters of neural network efficiently. As a result they are huge in terms of parameters and number of operations; hence slow too. In this paper, we propose a novel deep neural network architecture which allows it to learn without any significant increase in number of parameters. Our network uses only 11.5 million parameters and 21.2 GFLOPs for processing an image of resolution 3 × 640 × 360. It gives state-of-the-art performance on CamVid and comparable results on Cityscapes dataset. We also compare our networks processing time on NVIDIA GPU and embedded system device with existing state-of-the-art architectures for different image resolutions.
科研通智能强力驱动
Strongly Powered by AbleSci AI