聚类分析
计算机科学
人工智能
特征(语言学)
约束聚类
投影(关系代数)
高维数据聚类
相关聚类
数据挖掘
模式识别(心理学)
共识聚类
双聚类
CURE数据聚类算法
机器学习
算法
语言学
哲学
作者
Qiyue Yin,Shu Wu,Liang Wang
标识
DOI:10.1145/2806416.2806526
摘要
Multi-view clustering, which explores complementary information between multiple distinct feature sets for better clustering, has a wide range of applications, e.g., knowledge management and information retrieval. Traditional multi-view clustering methods usually assume that all examples have complete feature sets. However, in real applications, it is often the case that some examples lose some feature sets, which results in incomplete multi-view data and notable performance degeneration. In this paper, a novel incomplete multi-view clustering method is therefore developed, which learns unified latent representations and projection matrices for the incomplete multi-view data. To approximate the high level scaled indicator matrix defined to represent class label matrix, the latent representations are expected to be non-negative and column orthogonal. Besides, since data are often with high dimensional and noisy features, the projection matrices are enforced to be sparse so as to select relevant features when learning the latent space. Furthermore, the inter-view and intra-view data structure is preserved to further enhance the clustering performance. To these ends, an objective is developed with efficient optimization strategy and convergence analysis. Extensive experiments demonstrate that our model performs better than the state-of-the-art multi-view clustering methods in various settings.
科研通智能强力驱动
Strongly Powered by AbleSci AI