分割
计算机科学
特征提取
人工智能
特征(语言学)
模式识别(心理学)
骨干网
班级(哲学)
功能(生物学)
图像分割
卷积神经网络
对象(语法)
语义特征
语义学(计算机科学)
目标检测
图像(数学)
计算机网络
哲学
语言学
进化生物学
生物
程序设计语言
作者
Runze Wang,Haoyu Jiang,Yufei Li
标识
DOI:10.1109/icetci57876.2023.10177013
摘要
In this article, we adopt UPerNet as the model and ConvNeXt as the backbone for semantic segmentation of cityscapes. UPerNet is a highly compatible architecture that can handle a variety of visual tasks. At the same time, as a convolutional network without adding attention mechanism, convnext has excellent feature extraction ability.In this article, we combine UPerNet and ConvNeXt to obtain a semantic segmentation model. Their excellent feature extraction ability and multi-scale object detection capability are utilized to enhance the performance of the network. In addition, we select the appropriate ConvNeXt structure, i.e. ConvNeXt-small, according to the size and characteristics of cityscapes dataset, and optimized the loss function to solve the problem of uneven class distribution. The experimental results show that ConvNeXt has excellent performance in feature extraction. Compared with the classic backbone ResNet50, it has improved by 1.96% on aAcc, 12.28% on mIoU and 11.39% on mAcc. After the loss function is optimized, all metrics also increase, which fully proves that our method is effective.
科研通智能强力驱动
Strongly Powered by AbleSci AI