聚类分析
计算机科学
概念聚类
共识聚类
模糊聚类
透视图(图形)
双聚类
人工智能
数据挖掘
高维数据聚类
机器学习
情报检索
CURE数据聚类算法
数据科学
作者
Anil K. Jain,M. Narasimha Murty,Patrick J. Flynn
出处
期刊:ACM Computing Surveys
[Association for Computing Machinery]
日期:1999-09-01
卷期号:31 (3): 264-323
被引量:13086
标识
DOI:10.1145/331499.331504
摘要
Clustering is the unsupervised classification of patterns (observations, data items, or feature vectors) into groups (clusters). The clustering problem has been addressed in many contexts and by researchers in many disciplines; this reflects its broad appeal and usefulness as one of the steps in exploratory data analysis. However, clustering is a difficult problem combinatorially, and differences in assumptions and contexts in different communities has made the transfer of useful generic concepts and methodologies slow to occur. This paper presents an overview of pattern clustering methods from a statistical pattern recognition perspective, with a goal of providing useful advice and references to fundamental concepts accessible to the broad community of clustering practitioners. We present a taxonomy of clustering techniques, and identify cross-cutting themes and recent advances. We also describe some important applications of clustering algorithms such as image segmentation, object recognition, and information retrieval.
科研通智能强力驱动
Strongly Powered by AbleSci AI