过采样
数据库扫描
计算机科学
聚类分析
随机性
噪音(视频)
集合(抽象数据类型)
水准点(测量)
模式识别(心理学)
人工智能
算法
数据挖掘
数学
相关聚类
统计
CURE数据聚类算法
图像(数学)
带宽(计算)
地理
程序设计语言
计算机网络
大地测量学
作者
Eshan Bajal,Vipin Katara,Madhulika Bhatia,Madhurima Hooda
标识
DOI:10.2174/1872212115666210208222231
摘要
Abstract: The two most widely used and easily implementable algorithm for clustering and classification-based analysis of data in the unsupervised learning domain are Density-Based Spatial Clustering of Applications with Noise and K-mean cluster analysis. These two techniques can handle most cases effective when the data has a lot of randomness with no clear set to use as a parameter as in case of linear or logistic regression algorithms. However few papers exist that pit these two against each other in a controlled environment to observe which one reigns supreme and conditions required for the same. In this paper, a renal adenocarcinoma dataset is analyzed and thereafter both DBSCAN and K-mean are applied on the dataset with subsequent examination of the results. The efficacy of both the techniques in this study is compared and based on them the merits and demerits observed are enumerated. Further, the interaction of t-SNE with the generated clusters are explored.
科研通智能强力驱动
Strongly Powered by AbleSci AI