计算机科学
朴素贝叶斯分类器
班级(哲学)
条件独立性
数据挖掘
人工智能
机器学习
估计员
光学(聚焦)
冗余(工程)
支持向量机
数学
统计
物理
光学
操作系统
作者
Huan Zhang,Liangxiao Jiang,Wenjun Zhang,Chaoqun Li
标识
DOI:10.1109/tkde.2022.3177634
摘要
Naive Bayes (NB) continues to be one of the top 10 data mining algorithms due to its simplicity, efficiency and efficacy. Numerous enhancements have been proposed to weaken its attribute conditional independence assumption. However, all of them only focus on the raw attribute view, which is hard to reflect all the data characteristics in real-world applications. To portray data characteristics more comprehensively, in this study, we construct two label views from the raw attributes and propose a novel model called multi-view attribute weighted naive Bayes (MAWNB). In MAWNB, we first build multiple super-parent one-dependence estimators (SPODEs) as well as random trees (RTs), then we utilize each of them to classify each training instance in turn and use all their predicted class labels to construct two label views. Next, to avoid attribute redundancy, we optimize the weight of each attribute value for each class by minimizing the negative conditional log-likelihood (CLL) in each view. Finally, the estimated class-membership probabilities by three views are fused to predict the class label for each test instance. Extensive experiments show that MAWNB significantly outperforms NB and all the other existing state-of-the-art competitors.
科研通智能强力驱动
Strongly Powered by AbleSci AI