Density Peak Clustering with connectivity estimation

聚类分析 欧几里德距离 计算机科学 星团(航天器) 图形 点(几何) 欧几里德几何 数据挖掘 相似性(几何) 算法 模式识别(心理学) 人工智能 数学 理论计算机科学 程序设计语言 几何学 图像(数学)
作者
Wenjie Guo,Wenhai Wang,Shunping Zhao,Yunlong Niu,Zeyin Zhang,Xinggao Liu
出处
期刊:Knowledge Based Systems [Elsevier BV]
卷期号:243: 108501-108501 被引量:85
标识
DOI:10.1016/j.knosys.2022.108501
摘要

In 2014, a novel clustering algorithm called Density Peak Clustering (DPC) was proposed in journal Science, which has received great attention in many fields due to its simplicity and effectiveness. However, empirical studies have demonstrated that DPC has two main deficiencies: 1. It is very hard to identify the true cluster centers in the decision graph provided by DPC, especially when handling clusters with non-spherical shapes and non-uniform densities; 2. The performance of DPC is significantly affected by the ‘chain reaction’, i.e., an incorrect assignment of the point with the highest density of a region will lead all points in this region to the same wrong cluster. To address these two deficiencies, a density peak clustering with connectivity estimation (DPC”–CE) is presented. In the improved algorithm, points with higher relative distance are chosen as local centers for further calculation. Then a graph-based strategy is proposed to estimate the connectivity information between local centers. With the estimated information, a distance punishment which considers both Euclidean distance and connectivity information is further applied to reassess the similarity between local centers. By adding connectivity information into distance calculation, DPC-CE can not only ensure the true cluster centers can stand out in the decision graph, but also assign all local centers correctly, even on clusters with arbitrary shapes and non-uniform densities. And because of the ‘chain reaction’ we discussed above, those local centers will further lead all points around them to the right cluster. Experimental results on 14 synthetic datasets and 10 read-world datasets demonstrate the effectiveness and robustness of DPC”–CE in terms of three evaluation metrics.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
可爱的函函应助cmd采纳,获得10
1秒前
不与旋覆应助yellow采纳,获得10
1秒前
刻苦苑博发布了新的文献求助10
1秒前
2秒前
杨自强发布了新的文献求助10
2秒前
Kyrie发布了新的文献求助10
3秒前
nn完成签到,获得积分10
3秒前
3秒前
张弘发布了新的文献求助10
3秒前
李爱国应助guajiguaji采纳,获得10
3秒前
阳光的雯完成签到,获得积分20
3秒前
lui发布了新的文献求助30
3秒前
潇潇发布了新的文献求助10
3秒前
4秒前
4秒前
4秒前
李健的小迷弟应助Zurlliant采纳,获得10
4秒前
赘婿应助T_KYG采纳,获得10
5秒前
赘婿应助赵培媛采纳,获得10
5秒前
5秒前
daidai完成签到,获得积分10
6秒前
7秒前
7秒前
欢呼耳机发布了新的文献求助10
7秒前
Crystal完成签到 ,获得积分10
7秒前
7秒前
那谁谁完成签到,获得积分10
8秒前
桐桐应助qq采纳,获得10
8秒前
领导范儿应助周再乐采纳,获得10
9秒前
zhang完成签到,获得积分10
9秒前
Ava应助格物致知采纳,获得10
10秒前
10秒前
富贵发布了新的文献求助10
10秒前
11秒前
风之梦完成签到 ,获得积分10
11秒前
ok发布了新的文献求助10
11秒前
一一发布了新的文献求助10
11秒前
852应助周一一采纳,获得10
11秒前
11秒前
高分求助中
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Materials selection in mechanical design 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Ideology and Meaning-Making under the Putin Regime 450
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6479469
求助须知:如何正确求助?哪些是违规求助? 8280603
关于积分的说明 17661739
捐赠科研通 5562111
什么是DOI,文献DOI怎么找? 2911422
邀请新用户注册赠送积分活动 1888488
关于科研通互助平台的介绍 1742583