聚类分析
选择(遗传算法)
数学
稳健性(进化)
确定数据集中的群集数
星团(航天器)
理论(学习稳定性)
计算机科学
数据挖掘
相关聚类
CURE数据聚类算法
统计
人工智能
机器学习
基因
生物化学
化学
程序设计语言
作者
Yixin Fang,Junhui Wang
标识
DOI:10.1016/j.csda.2011.09.003
摘要
Here the problem of selecting the number of clusters in cluster analysis is considered. Recently, the concept of clustering stability, which measures the robustness of any given clustering algorithm, has been utilized in Wang (2010) for selecting the number of clusters through cross validation. In this paper, an estimation scheme for clustering instability is developed based on the bootstrap, and then the number of clusters is selected so that the corresponding estimated clustering instability is minimized. The proposed selection criterion’s effectiveness is demonstrated on simulations and real examples.
科研通智能强力驱动
Strongly Powered by AbleSci AI