聚类分析
模糊聚类
可扩展性
计算机科学
相似性(几何)
模糊逻辑
人工智能
机器学习
代表(政治)
数据挖掘
图形
离群值
模式识别(心理学)
理论计算机科学
数据库
政治
政治学
法学
图像(数学)
作者
Bingbing Jiang,Chenglong Zhang,Zhongli Wang,Xinyan Liang,Peng Zhou,Liang Du,Qinghua Zhang,Weiping Ding,Yi Liu
标识
DOI:10.1109/tfuzz.2025.3581679
摘要
To partition samples into distinct clusters, Fuzzy C-Means (FCM) calculates the membership degrees of samples to cluster centers and provides soft labels, gaining significant attention in recent years. However, existing FCM methods encounter the following challenges. First, traditional FCM focuses on learning membership degrees, neglecting the data similarity structures. Second, graph-based FCM typically separates graph construction from clustering, overlooking the knowledge interaction between graphs and clustering, obtaining suboptimal performance. Third, exploring the similarity structures among all samples is computationally expensive for large-scale tasks. To solve these dilemmas, we propose a scalable fuzzy clustering with collaborative structure learning and preservation (CSLP), which simultaneously leverages both cluster information and similarity structures to learn an optimal membership degree representation. Specifically, a self-weighted manner is devised to measure the sample importance, thereby reducing the adverse impacts of outliers. Moreover, the graph is updated according to the data similarities in the membership degree representation, such that CSLP collaboratively learns the graph and membership degrees in a mutually reinforcing manner. Thus, the similarity structures are fully explored during clustering processes and preserved in the learned membership degrees, enhancing the discrimination of clustering labels. To further improve efficiency, an acceleration solution is developed to reduce the computational cost of CSLP by propagating membership degrees from potential centers to samples, making CSLP scalable for large-scale tasks. An iterative strategy is designed to solve the formulated objective function. Extensive experiments demonstrate that CSLP outperforms other fuzzy clustering methods in terms of both effectiveness and scalability.
科研通智能强力驱动
Strongly Powered by AbleSci AI