特征(语言学)
计算机科学
职位(财务)
人工智能
姿势
插值(计算机图形学)
卷积(计算机科学)
模式识别(心理学)
特征学习
特征提取
计算机视觉
算法
人工神经网络
图像(数学)
财务
哲学
语言学
经济
作者
Chen Wang,Yanghong Zhou,Feng Zhang,P.Y. Mok
出处
期刊:Neurocomputing
[Elsevier BV]
日期:2023-03-31
卷期号:537: 152-163
被引量:3
标识
DOI:10.1016/j.neucom.2023.03.063
摘要
Multi-scale feature fusion is a commonly-used module in existing deep-learning models, and feature misalignment occurs in the process of feature fusion. The spatial misalignment hinders the learning of semantic representation with multi-scale levels, but which has not received much attention. This misalignment problem is caused by the feature position shift after using the convolution and interpolation operation in feature fusion. To solve the misalignment problem, this paper formulates the shift error mathematically and proposes a plug-and-play unbiased feature position alignment strategy to align convolution with interpolation. As a model-agnostic approach, unbiased feature position alignment can boost the performance of different models without introducing extra parameters. Furthermore, the unbiased feature position alignment is applied to build an unbiased human pose estimation method. Experimental results have demonstrated the effectiveness of the proposed unbiased pose model in comparison to the state-of-the-arts, especially in the low-resolution field. The codes are shared at https://github.com/WangChen100/Unbiased-Feature-Position-Alignment-for-Human-Pose-Estimation.
科研通智能强力驱动
Strongly Powered by AbleSci AI