计算机科学
卷积神经网络
人工智能
杠杆(统计)
频道(广播)
比例(比率)
棱锥(几何)
模式识别(心理学)
空间分析
深度学习
联营
机器学习
计算机网络
数学
统计
物理
几何学
量子力学
作者
Xin Zeng,Shizhe Hu,Qifeng Guo,Yunpeng Wu,Yangdong Ye
标识
DOI:10.1145/3579654.3579655
摘要
Crowd counting has been a fundamental yet challenging problem in pattern recognition. Most recent deep models for crowd counting rely on Convolutional Neural Networks (CNNs). Although CNN visual features comprise the spatial and channel features, existing deep models on crowd counting have limited descriptive ability as they only focus on the spatial or channel information. In this paper, we propose Scale Enhanced Network with Dual Attention Booster named as SEN-DAB, a novel method to jointly learn the representations of spatial and channel information for crowd counting. Moreover, to further leverage the multi-scale information, a pyramid residual scale enhanced block is presented to process the multi-scale features. As a result, the learned spatial, channel and multi-scale features can be robust to appearance changes of the crowd. Our model is tested on three benchmarks and the experimental results confirm that the promising performance of SEN-DAB when compared with various networks.
科研通智能强力驱动
Strongly Powered by AbleSci AI