计算机科学
稳健性(进化)
训练集
人工智能
噪声数据
噪音(视频)
注释
障碍物
机器学习
模式识别(心理学)
数据挖掘
图像(数学)
基因
生物化学
化学
法学
政治学
作者
Chuanyi Zhang,Yazhou Yao,Xing Xu,Jie Shao,Jingkuan Song,Zechao Li,Zhenmin Tang
标识
DOI:10.1145/3474085.3475536
摘要
Fine-grained visual recognition tasks typically require training data with reliable acquisition and annotation processes. Acquiring such datasets with precise fine-grained annotations is very expensive and time-consuming. Conversely, a vast amount of web data is relatively easy to obtain with nearly no human effort. Nevertheless, the presence of label noise in web images becomes a huge obstacle for training robust fine-grained recognition models. In this work, we investigate the noisy label problem and propose a method that can specifically distinguish in- and out-of-distribution noisy samples. It can purify the web training data by discarding out-of-distribution noisy images and relabeling in-distribution ones. After purification, we can train the model on a less noisy web training set to achieve better robustness and performance. Extensive experiments on three real-world web datasets for fine-grained visual recognition demonstrate the superiority of our approach.
科研通智能强力驱动
Strongly Powered by AbleSci AI