亲缘关系
聚类分析
样品(材料)
张量(固有定义)
计算机科学
数学
人工智能
化学
色谱法
纯数学
立体化学
作者
Hongmin Cai,Fei Qi,Junyu Li,Yu Hu,Bin Hu,Yue Zhang,Yiu‐ming Cheung
标识
DOI:10.1109/tnnls.2024.3439545
摘要
Traditional clustering methods rely on pairwise affinity to divide samples into different subgroups. However, high-dimensional small-sample (HDLSS) data are affected by the concentration effects, rendering traditional pairwise metrics unable to accurately describe relationships between samples, leading to suboptimal clustering results. This article advances the proposition of employing high-order affinities to characterize multiple sample relationships as a strategic means to circumnavigate the concentration effects. We establish a nexus between different order affinities by constructing specialized decomposable high-order affinities, thereby formulating a uniform mathematical framework. Building upon this insight, a novel clustering method named uniform tensor clustering (UTC) is proposed, which learns a consensus low-dimensional embedding for clustering by the synergistic exploitation of multiple-order affinities. Extensive experiments on synthetic and real-world datasets demonstrate two findings: 1) high-order affinities are better suited for characterizing sample relationships in complex data and 2) reasonable use of different order affinities can enhance clustering effectiveness, especially in handling high-dimensional data.
科研通智能强力驱动
Strongly Powered by AbleSci AI