Density Peak Clustering with connectivity estimation

聚类分析 欧几里德距离 计算机科学 星团(航天器) 图形 点(几何) 欧几里德几何 数据挖掘 相似性(几何) 算法 模式识别(心理学) 人工智能 数学 理论计算机科学 程序设计语言 几何学 图像(数学)
作者
Wenjie Guo,Wenhai Wang,Shunping Zhao,Yunlong Niu,Zeyin Zhang,Xinggao Liu
出处
期刊:Knowledge Based Systems [Elsevier BV]
卷期号:243: 108501-108501 被引量:85
标识
DOI:10.1016/j.knosys.2022.108501
摘要

In 2014, a novel clustering algorithm called Density Peak Clustering (DPC) was proposed in journal Science, which has received great attention in many fields due to its simplicity and effectiveness. However, empirical studies have demonstrated that DPC has two main deficiencies: 1. It is very hard to identify the true cluster centers in the decision graph provided by DPC, especially when handling clusters with non-spherical shapes and non-uniform densities; 2. The performance of DPC is significantly affected by the ‘chain reaction’, i.e., an incorrect assignment of the point with the highest density of a region will lead all points in this region to the same wrong cluster. To address these two deficiencies, a density peak clustering with connectivity estimation (DPC”–CE) is presented. In the improved algorithm, points with higher relative distance are chosen as local centers for further calculation. Then a graph-based strategy is proposed to estimate the connectivity information between local centers. With the estimated information, a distance punishment which considers both Euclidean distance and connectivity information is further applied to reassess the similarity between local centers. By adding connectivity information into distance calculation, DPC-CE can not only ensure the true cluster centers can stand out in the decision graph, but also assign all local centers correctly, even on clusters with arbitrary shapes and non-uniform densities. And because of the ‘chain reaction’ we discussed above, those local centers will further lead all points around them to the right cluster. Experimental results on 14 synthetic datasets and 10 read-world datasets demonstrate the effectiveness and robustness of DPC”–CE in terms of three evaluation metrics.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
lk完成签到,获得积分10
2秒前
2秒前
深情安青应助lehua采纳,获得10
3秒前
林林l完成签到,获得积分10
3秒前
yyyy发布了新的文献求助10
5秒前
6秒前
7秒前
Regsey完成签到,获得积分20
7秒前
欧米伽发布了新的文献求助10
8秒前
8秒前
zhdjj发布了新的文献求助10
8秒前
9秒前
打工羊完成签到,获得积分10
9秒前
haui完成签到,获得积分10
9秒前
9秒前
9秒前
优雅的千凝完成签到,获得积分10
10秒前
大模型应助彩色的电脑采纳,获得10
12秒前
lyy完成签到,获得积分10
13秒前
机智的飞鸟完成签到 ,获得积分10
13秒前
wy发布了新的文献求助10
13秒前
在水一方应助不喝汽水采纳,获得10
13秒前
13秒前
13秒前
shuang发布了新的文献求助10
14秒前
zz完成签到,获得积分10
14秒前
garvey发布了新的文献求助10
15秒前
17秒前
66完成签到,获得积分10
17秒前
斯文败类应助LXN采纳,获得10
17秒前
乌拉挂机完成签到,获得积分10
17秒前
18秒前
wansc完成签到,获得积分10
19秒前
19秒前
19秒前
JamesPei应助青菜拌洋葱采纳,获得10
20秒前
青青发布了新的文献求助10
22秒前
852应助蓝天采纳,获得10
22秒前
壮观人达发布了新的文献求助10
22秒前
高分求助中
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
Climate change and sports: Statistics report on climate change and sports 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
Organic Reactions Volume 118 400
A Foreign Missionary on the Long March: The Unpublished Memoirs of Arnolis Hayman of the China Inland Mission 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6466799
求助须知:如何正确求助?哪些是违规求助? 8273127
关于积分的说明 17639885
捐赠科研通 5541883
什么是DOI,文献DOI怎么找? 2908026
邀请新用户注册赠送积分活动 1884980
关于科研通互助平台的介绍 1733225