聚类分析
数学
统计
航程(航空)
数据集
集合(抽象数据类型)
启发式
计算机科学
数学优化
复合材料
材料科学
程序设计语言
作者
W. J. Krzanowski,Y.-T. Lai
出处
期刊:Biometrics
[Oxford University Press]
日期:1988-03-01
卷期号:44 (1): 23-23
被引量:700
摘要
Marriott (1971, Biometrics 27, 501-514) used a heuristic argument to derive the criterion g2 l W I for determining the number of groups in a data set when the clustering objective function is the withingroup determinant I W 1. An analogous argument is employed to derive a criterion for use with the within-group sum-of-squares objective function trace (W). The behaviour of both Marriott's criterion and the new criterion is investigated by Monte Carlo methods. For homogeneous data based on uniform and independent variables, the performance of the new criterion is close to expectation while Marriott's criterion shows much more extreme behaviour. For grouped data, the new criterion correctly identifies the number of groups in 85% of data sets under a wide range of conditions, while Marriott's criterion shows a success rate of less than 40%. The new criterion is illustrated on the wellknown Iris data, and some cautionary comments are made about its use.
科研通智能强力驱动
Strongly Powered by AbleSci AI