计算机科学
矩阵分解
代表(政治)
非负矩阵分解
聚类分析
低秩近似
双聚类
模式识别(心理学)
数据挖掘
基质(化学分析)
秩(图论)
人工智能
数学
模糊聚类
CURE数据聚类算法
特征向量
量子力学
物理
组合数学
化学
数学分析
色谱法
汉克尔矩阵
政治
政治学
法学
作者
Wei Zhang,Xiaoli Xue,Xiaoying Zheng,Zizhu Fan
标识
DOI:10.1109/jbhi.2021.3099127
摘要
Fast-developing single-cell technologies create unprecedented opportunities to reveal cell heterogeneity and diversity. Accurate classification of single cells is a critical prerequisite for recovering the mechanisms of heterogeneity. However, the scRNA-seq profiles we obtained at present have high dimensionality, sparsity, and noise, which pose challenges for existing clustering methods in grouping cells that belong to the same subpopulation based on transcriptomic profiles. Although many computational methods have been proposed developing novel and effective computational methods to accurately identify cell types remains a considerable challenge. We present a new computational framework to identify cell types by integrating low-rank representation (LRR) and nonnegative matrix factorization (NMF); this framework is named NMFLRR. The LRR captures the global properties of original data by using nuclear norms, and a locality constrained graph regularization term is introduced to characterize the data's local geometric information. The similarity matrix and low-dimensional features of data can be simultaneously obtained by applying the alternating direction method of multipliers (ADMM) algorithm to handle each variable alternatively in an iterative way. We finally obtained the predicted cell types by using a spectral algorithm based on the optimized similarity matrix. Nine real scRNA-seq datasets were used to test the performance of NMFLRR and fifteen other competitive methods, and the accuracy and robustness of the simulation results suggest the NMFLRR is a promising algorithm for the classification of single cells. The simulation code is freely available at: https://github.com/wzhangwhu/NMFLRR_code.
科研通智能强力驱动
Strongly Powered by AbleSci AI