分割
计算机科学
块(置换群论)
瓶颈
棱锥(几何)
残余物
人工智能
特征(语言学)
膨胀(度量空间)
模式识别(心理学)
计算机视觉
算法
数学
嵌入式系统
几何学
哲学
组合数学
语言学
作者
Saquib Mazhar,Nadeem Atif,M. K. Bhuyan,Shaik Rafi Ahamed
出处
期刊:IEEE transactions on artificial intelligence
[Institute of Electrical and Electronics Engineers]
日期:2023-12-12
卷期号:5 (6): 3098-3108
被引量:10
标识
DOI:10.1109/tai.2023.3341976
摘要
Recent advancements in autonomous driving and mobile devices have led to the development of real-time and lightweight semantic image segmentation models. However, these algorithms readily suffer from inherent accuracy loss compared to large networks. DABNet [1] presented a highly efficient method to balance the accuracy-model size tradeoff. Nevertheless, the bottleneck structure and single-scale receptive field of its building block have limited performance for the given network size. To further improve the segmentation score and reduce the number of parameters, the basic block is redesigned using an inverted-residual and dilation pyramid structure (IRDP). The IRDP module can efficiently learn contextual features at multiple dilations within the block. Using the inverted-residual structure with an expansion layer prevents information loss due to the dimensionality reduction of the feature space. The IRDP block is utilized to rebuild the DABNet structure, working in real-time for resource-constrained devices. In addition, a fast and lightweight decoder-FLD is also proposed to improve the segmentation accuracy of the network. Experiments performed on Cityscapes and CamVid datasets demonstrate the effectiveness of the proposed approach. On Cityscapes, IRDPNet can achieve an mIOU of 75.62%. At the same time, the lighter version gets a mIoU of 71.32% with only 0.32 Million parameters, which is similar to the DABNet accuracy with half the number of parameters.
科研通智能强力驱动
Strongly Powered by AbleSci AI