众包
噪音(视频)
多数决原则
计算机科学
基本事实
人工智能
公制(单位)
推论
集合(抽象数据类型)
滤波器(信号处理)
投票
机器学习
噪声测量
特征(语言学)
模式识别(心理学)
数据挖掘
算法
降噪
计算机视觉
图像(数学)
程序设计语言
法学
万维网
经济
哲学
政治
语言学
运营管理
政治学
作者
Huiru Li,Liangxiao Jiang,Xue Song
出处
期刊:ACM Transactions on Knowledge Discovery From Data
[Association for Computing Machinery]
日期:2023-04-14
卷期号:17 (7): 1-18
被引量:3
摘要
In crowdsourcing scenarios, we can obtain each instance’s multiple noisy labels set from different crowd workers and then use a ground truth inference algorithm to infer its integrated label. Despite the effectiveness of ground truth inference algorithms, a certain level of noise still remains in the integrated labels. To reduce the impact of noise, many noise correction algorithms have been proposed in recent years. To the best of our knowledge, however, nearly all existing noise correction algorithms only exploit each instance’s own multiple noisy label sets but ignore the multiple noisy label sets of its neighbors. Here neighbors refer to the nearest instances found in the feature space based on the distance metric learning. In this article, we propose neighborhood weighted voting-based noise correction (NWVNC). In NWVNC, we at first take advantage of the multiple noisy label sets of each instance’s neighbors (including itself) to estimate the probability that it belongs to its integrated label. Then, we use the estimated probability to identify and filter noise instances and thus obtain a clean set and a noise set. Finally, we train three heterogeneous classifiers on the clean set and correct the noise instances by the consensus voting of three trained classifiers. The experimental results on 34 simulated and two real-world crowdsourced datasets show that NWVNC significantly outperforms all the other state-of-the-art noise correction algorithms used for comparison.
科研通智能强力驱动
Strongly Powered by AbleSci AI