计算机科学
行人
比例(比率)
质量(理念)
地理
运输工程
地图学
工程类
物理
量子力学
作者
Jie Wu,Ying Peng,Chenghao Zheng,Zongbo Hao,Jian Zhang
出处
期刊:Cornell University - arXiv
日期:2019-01-01
被引量:4
标识
DOI:10.48550/arxiv.1912.12799
摘要
Recently, generative adversarial networks (GANs) have shown great advantages in synthesizing images, leading to a boost of explorations of using faked images to augment data. This paper proposes a multimodal cascaded generative adversarial networks (PMC-GANs) to generate realistic and diversified pedestrian images and augment pedestrian detection data. The generator of our model applies a residual U-net structure, with multi-scale residual blocks to encode features, and attention residual blocks to help decode and rebuild pedestrian images. The model constructs in a coarse-to-fine fashion and adopts cascade structure, which is beneficial to produce high-resolution pedestrians. PMC-GANs outperforms baselines, and when used for data augmentation, it improves pedestrian detection results.
科研通智能强力驱动
Strongly Powered by AbleSci AI