计算机科学
聚类分析
整数规划
冗余(工程)
线性规划
集合(抽象数据类型)
任务(项目管理)
数据挖掘
理论计算机科学
数学优化
机器学习
算法
数学
管理
经济
程序设计语言
操作系统
作者
Luiz Henrique Nogueira Lorena,Marcos G. Quiles,Luiz Antônio Nogueira Lorena,André C. P. L. F. de Carvalho,Juliana Garcia Cespedes
标识
DOI:10.1109/ijcnn.2019.8851969
摘要
Qualitative data clustering is a fundamental data analysis task, with applications in many areas, like medicine, sociology, and economics. An appealing way to deal with this task is via Integer Linear Programming, as it avoids inappropriate inferences by the final user. This approach has two main advantages: the data are directly used, without the need of being converted to quantitative values, and the optimal number of clusters is automatically obtained by solving the optimization problem. However, it might create large and redundant models, which can limit the size of the problems it can be applied. Recently, models that are more compact and able to avoid some redundancy have been proposed in the literature. These models consume less memory and are faster to obtain the optimal solution set. In this study, a new model is introduced and compared with the state-of-the-art alternatives using datasets from different application domains. Empirical results show that the new model outperforms its predecessors, achieving the optimal solution set with lower computational time and memory consumption.
科研通智能强力驱动
Strongly Powered by AbleSci AI