计算机科学
范畴变量
离群值
数据挖掘
图像(数学)
人工智能
行人检测
集合(抽象数据类型)
异常检测
数据清理
行人
模式识别(心理学)
机器学习
公制(单位)
工程类
运营管理
数据质量
运输工程
程序设计语言
作者
Yunfeng Ma,Min Liu,Yi Tang,Xueping Wang,Yaonan Wang
出处
期刊:IEEE Transactions on Instrumentation and Measurement
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:73: 1-12
标识
DOI:10.1109/tim.2023.3336760
摘要
Data augmentation (DA) is a commonly used method to alleviate the problem of detecting occluded pedestrians in crowded scenes. Recently, several dataset-level automatic data augmentation methods have been proposed to search for a set of general DA policies for the entire dataset, which saves a lot of time compared to manually designing DA policies. However, due to the huge differences between each image in pedestrian detection datasets, existing dataset-level augmentation methods cannot automatically adjust DA policies according to the differences between them, which will lead to outlier data and degradation of model performance. Therefore, considering the differences between each image, we propose an image-level automatic data augmentation method that aims to find an optimal DA policy for each image in the dataset according to their respective characteristics. Specifically, we first reformulate the image-level automatic data augmentation method by constructing a search space based on Categorical distribution, within which we specify the probability of operations being sampled according to their respective effectiveness, so that useful operations can be effectively preserved and useless operations can be suppressed. Subsequently, we design an encoding method to recode the index of images and policies, and use the encoded index to closely associate them to achieve a stable matching relationship between images and policies. Finally, a search framework with Bayesian optimization is developed for efficient policy mining. Comprehensive experiments on CrowdHuman and CityPersons datasets show that compared with the commonly used automatic data augmentation method for pedestrian detection, AutoPedestrian, our method takes only 1/14 of the search time, but achieves better detection accuracy. Specifically, we achieve 10.2% MR -2 on CityPersons reasonable subset and 36.8% MR -2 on CrowdHuman dataset, outperforming state-of-the-art methods on CrowdHuman dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI