遗传程序设计
班级(哲学)
计算机科学
人工智能
基线(sea)
机器学习
一级分类
模式识别(心理学)
遗传算法
边界(拓扑)
数据挖掘
边界判定
支持向量机
数学
数学分析
海洋学
地质学
作者
Wenbin Pei,Bing Xue,Lin Shang,Jun Zhang
标识
DOI:10.1145/3449639.3459284
摘要
In classification, when class overlap is intertwined with the issue of class imbalance, it is often challenging to discover useful patterns because of an ambiguous boundary between the majority class and the minority class. This becomes more difficult if the data is high-dimensional. To date, very few pieces of work have investigated how the class overlap issue can be effectively addressed or alleviated in classification with high-dimensional unbalanced data. In this paper, we propose a new genetic programming based method, which is able to automatically and directly detect borderline instances, in order to address the class overlap issue in classification with high-dimensional unbalanced data. In the proposed method, each individual has two trees to be trained together based on different classification rules. The proposed method is examined and compared with baseline methods on high-dimensional unbalanced datasets. Experimental results show that the proposed method achieves better classification performance than the baseline methods in almost all cases.
科研通智能强力驱动
Strongly Powered by AbleSci AI