聚类分析
计算机科学
层次聚类
数据挖掘
混合模型
背景(考古学)
星团(航天器)
集合(抽象数据类型)
数据集
基础(线性代数)
人工智能
机器学习
数学
古生物学
几何学
生物
程序设计语言
标识
DOI:10.1177/096228029200100103
摘要
In this paper we review methods of cluster analysis in the context of classifying patients on the basis of clinical and/or laboratory type observations. Both hierarchical and non-hierarchical methods of clustering are considered, although the emphasis is on the latter type, with particular attention devoted to the mixture likelihood-based approach. For the purposes of dividing a given data set into g clusters, this approach fits a mixture model of g components, using the method of maximum likelihood. It thus provides a sound statistical basis for clustering. The important but difficult question of how many clusters are there in the data can be addressed within the framework of standard statistical theory, although theoretical and computational difficulties still remain. Two case studies, involving the cluster analysis of some haemophilia and diabetes data respectively, are reported to demonstrate the mixture likelihood-based approach to clustering.
科研通智能强力驱动
Strongly Powered by AbleSci AI