计算机科学
特征(语言学)
语义特征
分割
人工智能
模式识别(心理学)
语义压缩
语义网络
语义学(计算机科学)
自然语言处理
语义计算
语义技术
语义网
程序设计语言
语言学
哲学
作者
Dakai Wang,Wenhao An,Jianxin Ma,Li Wang
标识
DOI:10.1016/j.dsp.2024.104639
摘要
To address the issues of poor segmentation accuracy and insensitivity to details in semantic segmentation, this paper proposes a novel image semantic segmentation framework SEFANet. Specifically, SEFANet adopts encoder-decoder structure, and incorporates a novel perceptual enhancement mechanism called Multi-scale Spatial Integration Module (MSIM) at the encoder. MSIM is based on group convolution to boost spatial semantic features and refine spatial-gradient semantic features within the multi-scale structure. This module enhances feature extraction across different network levels, leading to improved edge detection and segmentation abilities. In the decoder, SEFANet introduces a pixel-level Interleaved Feature Alignment Module (IFAM), which leverages rich semantic information in low-dimensional features and the strategy of Semantic Offset Field. Meanwhile, IFAM warps the high-dimensional feature map into low-dimensional features, completing the calibration process through convolution operations. Experimental results on the Pascal VOC2012 val dataset and the Cityscapes val dataset confirm the effectiveness and generalization of the proposed semantic segmentation. Additionally, the results further demonstrate that SEFANet improves the poor segmentation accuracy and insensitivity to details, and achieves a competitive performance compared with other semantic segmentation methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI