聚类分析
计算机科学
子空间拓扑
约束(计算机辅助设计)
表达式(计算机科学)
利用
DNA甲基化
数据集成
代表(政治)
人工智能
秩(图论)
数据挖掘
张量(固有定义)
计算生物学
机器学习
理论计算机科学
作者
Xiaowei Gao,Xiaogang Liu,Xiaoke Ma
标识
DOI:10.1109/bibm52615.2021.9669285
摘要
The accumulated gene expression and DNA methylation data provide a great opportunity to exploit the mechanisms of biological systems. Current algorithms for the integration of gene expression and methylation data are characterized for undesirable performance because they fail to address the latent relations in the heterogeneous data. To solve this problem, we propose a novel multi-view clustering with self-representation learning and low-rank tensor constraint (MCSL-LTC), where the gene expression and DNA methylation data are treated as complementary views, and MCSL-LTC obtains a consensus partitioning reflecting the structure and features of various views. Specifically, self-representation learning is employed to explore the low-dimensional subspace structures embedded in different views, where the tensor norm is adopted to smooth different views, therefore improving the quality of features. Experimental results demonstrate that the proposed approach outperforms state-of-the-art baselines in terms of accuracy on both the social and cancer data, provides an effective and efficient method for the integration of heterogeneous genomic data.
科研通智能强力驱动
Strongly Powered by AbleSci AI