计算机科学
姿势
特征(语言学)
频道(广播)
卷积(计算机科学)
点式的
人工智能
编码(集合论)
数据聚合器
可分离空间
比例(比率)
模式识别(心理学)
数据挖掘
数学
人工神经网络
无线传感器网络
地图学
计算机网络
数学分析
哲学
语言学
集合(抽象数据类型)
程序设计语言
地理
作者
Beitao Chen,Xuanhan Wang,Xiaojia Chen,Yulan He,Jingkuan Song
标识
DOI:10.1109/icme55011.2023.00449
摘要
Existing solutions to lightweight human pose estimation typically adopt a depthwise separable strategy, i.e., a normal 2D convolution is factorized into channel aggregation and spatial aggregation. However, this strategy cannot well capture multi-scale Effective Receptive Field (ERF), which is essential to dense prediction tasks like human pose estimation. To address this issue, we propose a novel lightweight network for human pose estimation, namely effective aggregation net (EANet). In EANet, we introduce two lightweight computational units: effective channel aggregating (ECA) and effective spatial aggregating (ESA), which are respectively responsible for channel-wise feature aggregation and pixel-wise feature aggregation. Unlike typical channel-wise aggregation using pointwise (1 × 1) convolution, the ECA aggregates few feature points that are estimated as effective ones. Moreover, the ESA is designed with re-parameterizing techniques, and it aggregates effective spatial feature points with multi-scale shared convolutions. Comprehensive experiments are conducted on three challenging datasets, i.e., COCO, Crowd-Pose, Wholebody-COCO. Our EANet demonstrates superior results on human pose estimation over previous lightweight methods, reaching a new state-of-the-art performance with a good trade-off. Our code and models are publicly available1.
科研通智能强力驱动
Strongly Powered by AbleSci AI