Image-Level Automatic Data Augmentation for Pedestrian Detection

计算机科学范畴变量离群值数据挖掘图像（数学）人工智能行人检测集合（抽象数据类型）异常检测数据清理行人模式识别（心理学）机器学习公制（单位）工程类运营管理数据质量运输工程程序设计语言

作者

Yunfeng Ma,Min Liu,Yi Tang,Xueping Wang,Yaonan Wang

出处

期刊：IEEE Transactions on Instrumentation and Measurement [Institute of Electrical and Electronics Engineers]
日期：2024-01-01 卷期号：73: 1-12

标识

DOI：10.1109/tim.2023.3336760

摘要

Data augmentation (DA) is a commonly used method to alleviate the problem of detecting occluded pedestrians in crowded scenes. Recently, several dataset-level automatic data augmentation methods have been proposed to search for a set of general DA policies for the entire dataset, which saves a lot of time compared to manually designing DA policies. However, due to the huge differences between each image in pedestrian detection datasets, existing dataset-level augmentation methods cannot automatically adjust DA policies according to the differences between them, which will lead to outlier data and degradation of model performance. Therefore, considering the differences between each image, we propose an image-level automatic data augmentation method that aims to find an optimal DA policy for each image in the dataset according to their respective characteristics. Specifically, we first reformulate the image-level automatic data augmentation method by constructing a search space based on Categorical distribution, within which we specify the probability of operations being sampled according to their respective effectiveness, so that useful operations can be effectively preserved and useless operations can be suppressed. Subsequently, we design an encoding method to recode the index of images and policies, and use the encoded index to closely associate them to achieve a stable matching relationship between images and policies. Finally, a search framework with Bayesian optimization is developed for efficient policy mining. Comprehensive experiments on CrowdHuman and CityPersons datasets show that compared with the commonly used automatic data augmentation method for pedestrian detection, AutoPedestrian, our method takes only 1/14 of the search time, but achieves better detection accuracy. Specifically, we achieve 10.2% MR ^-2 on CityPersons reasonable subset and 36.8% MR ^-2 on CrowdHuman dataset, outperforming state-of-the-art methods on CrowdHuman dataset.

求助该文献

最长约 10秒，即可获得该文献文件

Image-Level Automatic Data Augmentation for Pedestrian Detection

今日热心研友