计算机科学
新颖性
机器学习
人工智能
公制(单位)
主动学习(机器学习)
标记数据
训练集
性能指标
半监督学习
工程类
哲学
运营管理
神学
管理
经济
作者
J Li,Pengguang Chen,Shaozuo Yu,Shu Liu,Jiaya Jia
标识
DOI:10.1109/tpami.2023.3345844
摘要
The objective of Active Learning is to strategically label a subset of the dataset to maximize performance within a predetermined labeling budget. In this study, we harness features acquired through self-supervised learning. We introduce a straightforward yet potent metric, Cluster Distance Difference, to identify diverse data. Subsequently, we introduce a novel framework, Balancing Active Learning (BAL), which constructs adaptive sub-pools to balance diverse and uncertain data. Our approach outperforms all established active learning methods on widely recognized benchmarks by 1.20%. Moreover, we assess the efficacy of our proposed framework under extended settings, encompassing both larger and smaller labeling budgets. Experimental results demonstrate that, when labeling 80% of the samples, the performance of the current SOTA method declines by 0.74%, whereas our proposed BAL achieves performance comparable to the full dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI