聚类分析
邻接矩阵
高斯分布
光谱聚类
算法
计算机科学
确定数据集中的群集数
混合模型
密度估算
概率密度函数
数据挖掘
模式识别(心理学)
邻接表
数学
图形
人工智能
相关聚类
树冠聚类算法
理论计算机科学
统计
物理
估计员
量子力学
作者
Guanli Yue,Ansheng Deng,Yanpeng Qu,Hui Cui,Xueying Wang
标识
DOI:10.1016/j.ins.2023.03.067
摘要
Spectral clustering aims to minimise inter-cluster similarity by constructing graph model, which possesses a significant effect in data of arbitrary shape. Nonetheless, there are still two limitations in the existing algorithms. First, spectral methods perform poorly if the densities of instances vary greatly. Second, they are challenging to handle the complex distribution that different objects of densities are mixed. To address the two limitations, in this paper, a novel spectral clustering algorithm is constructed to accommodate more complex multi-density data. The idea of density stratification is proposed through the probability density of Gaussian mixture model, and the number of layers can be automatically determined by the defined scatter index. Then, a new density ratio is established to effectively reduce the density imbalance of diverse instances based on mutual nearest-neighbour and density stratification, which is more advantageous in the clusters with less clear boundaries. Finally, the optimised adjacency matrix and multi-density spectral clustering algorithm are induced to improve the effect of multi-density data. As per experimental results, the proposed algorithm generally outperforms the popular representative algorithms for both synthetic and benchmark datasets, and works more effectively on multi-density as well as single-density data, which illustrates the superiority of the proposed algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI