增采样
计算机科学
联营
人工智能
卷积(计算机科学)
棱锥(几何)
卷积神经网络
背景(考古学)
密度估算
透视图(图形)
模式识别(心理学)
像素
图像(数学)
人工神经网络
计算机视觉
数学
地理
统计
考古
估计员
几何学
作者
Liping Zhu,Chengyang Li,Bing Wang,Kun Yuan,Zhongguo Yang
出处
期刊:Neurocomputing
[Elsevier BV]
日期:2019-11-02
卷期号:378: 455-466
被引量:10
标识
DOI:10.1016/j.neucom.2019.10.081
摘要
Due to non-uniform density and variations in scale and perspective, estimating crowd count in crowded scenes in different degree is an extremely challenging task. The deep learning models mostly use pooling operation so that the density map of original resolution is obtained through the last upsampling. This paper aims to solve the problem of losing local spatial information by pooling in density map estimation. Therefore, we propose a dilated convolution neural network with global self-attention, named DCGSA. Especially, we introduce a Global Self-Attention module (GSA) to provide global context as guidance of low-level features to select person location details and a Pyramid Dilated Convolution module (PDC) that extracts channel-wise and pixel-wise features more precisely. Extensive experiments on several crowd datasets show that our method achieves lower crowd counting error and better density maps compared to the recent state-of-the-art methods. In particular, our method also performs well on the sparse dataset UCSD.
科研通智能强力驱动
Strongly Powered by AbleSci AI