USING INSTANCE CLONING TO IMPROVE NAIVE BAYES FOR RANKING

朴素贝叶斯分类器 机器学习 人工智能 排名(信息检索) 计算机科学 贝叶斯定理 Bayes错误率 数据挖掘 贝叶斯概率 支持向量机 贝叶斯分类器
作者
Liangxiao Jiang,Dianhong Wang,Harry Zhang,Zhihua Cai,Bo Huang
出处
期刊:International Journal of Pattern Recognition and Artificial Intelligence [World Scientific]
卷期号:22 (06): 1121-1140 被引量:16
标识
DOI:10.1142/s0218001408006703
摘要

Improving naive Bayes (simply NB) 15,28 for classification has received significant attention. Related work can be broadly divided into two approaches: eager learning and lazy learning. 1 Different from eager learning, the key idea for extending naive Bayes using lazy learning is to learn an improved naive Bayes for each test instance. In recent years, several lazy extensions of naive Bayes have been proposed. For example, LBR, 30 SNNB, 27 and LWNB. 8 All these algorithms aim to improve naive Bayes' classification performance. Indeed, they achieve significant improvement in terms of classification, measured by accuracy. In many real-world data mining applications, however, an accurate ranking is more desirable than an accurate classification. Thus a natural question is whether they also achieve significant improvement in terms of ranking, measured by AUC (the area under the ROC curve). 2,11,17 Responding to this question, we conduct experiments on the 36 UCI data sets 18 selected by Weka 12 to investigate their ranking performance and find that they do not significantly improve the ranking performance of naive Bayes. Aiming at scaling up naive Bayes' ranking performance, we present a novel lazy method ICNB (instance cloned naive Bayes) and develop three ICNB algorithms using different instance cloning strategies. We empirically compare them with naive Bayes. The experimental results show that our algorithms achieve significant improvement in terms of AUC. Our research provides a simple but effective method for the applications where an accurate ranking is desirable.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
破防的陈ber完成签到,获得积分10
1秒前
欢喜的祥发布了新的文献求助10
2秒前
crisprin完成签到,获得积分10
3秒前
4秒前
dde应助shinn采纳,获得10
5秒前
5秒前
古涛关注了科研通微信公众号
6秒前
科研通AI6.2应助芳菲采纳,获得10
7秒前
8秒前
可爱的函函应助叶远望采纳,获得10
8秒前
细斟北斗发布了新的文献求助10
9秒前
HQQ应助yier采纳,获得10
10秒前
金城中月完成签到,获得积分10
10秒前
11秒前
共享精神应助zz采纳,获得10
11秒前
健壮惋清发布了新的文献求助10
11秒前
打打应助JMrider采纳,获得10
11秒前
12秒前
13秒前
张安安完成签到,获得积分10
13秒前
14秒前
14秒前
15秒前
闪闪新梅完成签到,获得积分10
16秒前
17秒前
18秒前
张安安发布了新的文献求助10
18秒前
18秒前
冷傲迎梦完成签到,获得积分10
20秒前
塵亦完成签到,获得积分10
20秒前
dde应助shinn采纳,获得10
20秒前
沐辰完成签到,获得积分10
21秒前
KOC发布了新的文献求助10
22秒前
大模型应助hhh采纳,获得10
22秒前
火星弟弟完成签到,获得积分10
23秒前
雪白安筠发布了新的文献求助10
23秒前
23秒前
24秒前
24秒前
24秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
The Resilient Mindset 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 300
Upland Kenya wild flowers and ferns: a flora of the flowers, ferns, grasses, and sedges of highland Kenya 300
Disturbing the Quiet Life? Competition and CEO Incentives 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6654382
求助须知:如何正确求助?哪些是违规求助? 8407618
关于积分的说明 17977135
捐赠科研通 5851042
什么是DOI,文献DOI怎么找? 2972283
邀请新用户注册赠送积分活动 1948057
关于科研通互助平台的介绍 1869116