Fast LDP-MST: An Efficient Density-Peak-Based Clustering Method for Large-Size Datasets

符号 数学 聚类分析 组合数学 离散数学 算术 统计
作者
Teng Qiu,Yongjie Li
出处
期刊:IEEE Transactions on Knowledge and Data Engineering [Institute of Electrical and Electronics Engineers]
卷期号:35 (5): 4767-4780 被引量:11
标识
DOI:10.1109/tkde.2022.3150403
摘要

Recently, a new density-peak-based clustering method, called clustering with local density peaks-based minimum spanning tree (LDP-MST), was proposed, which has several attractive merits, e.g., being able to detect arbitrarily shaped clusters and not very sensitive to noise and parameters. Nevertheless, we also found the limitation of LDP-MST in efficiency. Specifically, LDP-MST has $O(N\log N+M^{2})$ time, where $N$ denotes the dataset size and $M$ is an intermediate variable denoting the number of local density peaks. As our experimental results reveal, when processing large-size datasets, the value of $M$ could be very large and consequently those steps of LDP-MST involving $O(M^{2})$ time term would be time-consuming. And in the worst case, the value of $M$ could be very close to that of $N$ , which means that the time complexity of LDP-MST could be $O(N^{2})$ in the worst case of $M$ . In this study, we use more efficient algorithms to implement those steps of LDP-MST that involve the $O(M^{2})$ time term such that the proposed method, Fast LDP-MST, has $O(N\log N)$ time complexity even if $M\approx N$ . Our experiments demonstrate that Fast LDP-MST is overall more efficient than LDP-MST on large-size datasets, without sacrificing the merits of LDP-MST in effectiveness, robustness, and user-friendliness.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
甜甜语堂完成签到,获得积分10
3秒前
4秒前
5秒前
111完成签到,获得积分10
5秒前
6秒前
1762120完成签到,获得积分10
8秒前
英俊的铭应助专注科研采纳,获得10
8秒前
8秒前
充电宝应助朴素凝丝采纳,获得10
9秒前
1762120发布了新的文献求助10
10秒前
shuxue发布了新的文献求助10
11秒前
欢喜大山发布了新的文献求助10
11秒前
13秒前
Omni发布了新的文献求助10
14秒前
14秒前
15秒前
16秒前
opp发布了新的文献求助20
16秒前
何不尽发布了新的文献求助10
17秒前
可爱的函函应助coubakuai采纳,获得10
18秒前
周昊完成签到,获得积分10
18秒前
明亮不乐完成签到,获得积分10
19秒前
天天快乐应助YY张采纳,获得10
21秒前
21秒前
22秒前
22秒前
lion_wei发布了新的文献求助10
22秒前
23秒前
爆米花应助yd采纳,获得10
24秒前
南巷清风给南巷清风的求助进行了留言
24秒前
isojso发布了新的文献求助30
25秒前
liii完成签到 ,获得积分10
25秒前
25秒前
朴素凝丝发布了新的文献求助10
26秒前
miaoda发布了新的文献求助10
26秒前
26秒前
27秒前
牟白容发布了新的文献求助10
28秒前
大模型应助Autumn采纳,获得10
28秒前
dmmmm发布了新的文献求助10
29秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 800
Recherches Ethnographiques sue les Yao dans la Chine du Sud 500
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 500
Chinese-English Translation Lexicon Version 3.0 500
Wisdom, Gods and Literature Studies in Assyriology in Honour of W. G. Lambert 400
薩提亞模式團體方案對青年情侶輔導效果之研究 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2393248
求助须知:如何正确求助?哪些是违规求助? 2097318
关于积分的说明 5284984
捐赠科研通 1825018
什么是DOI,文献DOI怎么找? 910081
版权声明 559943
科研通“疑难数据库(出版商)”最低求助积分说明 486329