MNIST数据库
聚类分析
光谱聚类
核(代数)
维数(图论)
数学
比例(比率)
趋同(经济学)
模式识别(心理学)
数据点
核方法
算法
计算机科学
人工智能
支持向量机
组合数学
物理
人工神经网络
量子力学
经济增长
经济
作者
Li He,Nilanjan Ray,Yisheng Guan,Hong Zhang
标识
DOI:10.1109/tcyb.2018.2794998
摘要
We propose an efficient spectral clustering method for large-scale data. The main idea in our method consists of employing random Fourier features to explicitly represent data in kernel space. The complexity of spectral clustering thus is shown lower than existing Nyström approximations on large-scale data. With m training points from a total of n data points, Nyström method requires O(nmd+m3+nm2) operations, where d is the input dimension. In contrast, our proposed method requires O(nDd+D3+n'D2) , where n' is the number of data points needed until convergence and D is the kernel mapped dimension. In large-scale datasets where n' << n hold true, our explicitly mapping method can significantly speed up eigenvector approximation and benefit prediction speed in spectral clustering. For instance, on MNIST (60 000 data points), the proposed method is similar in clustering accuracy to Nyström methods while its speed is twice as fast as Nyström.
科研通智能强力驱动
Strongly Powered by AbleSci AI