插补(统计学)
聚类分析
数据挖掘
计算机科学
计算生物学
生物
人工智能
缺少数据
机器学习
作者
Mostafa Eltager,Tamim Abdelaal,Ahmed Mahfouz,Marcel J. T. Reinders
标识
DOI:10.1101/2021.02.24.432644
摘要
Abstract Motivation Single-cell multi-omics assays simultaneously measure different molecular features from the same cell. A key question is how to benefit from the complementary data available and perform cross-modal clustering of cells. Results We propose S ingle- C ell M ulti- o mics C lustering (scMoC), an approach to identify cell clusters from data with co-measurements of scRNA-seq and scATAC-seq from the same cell. We overcome the high sparsity of the scATAC-seq data by using an imputation strategy that exploits the less-sparse scRNA-seq data available from the same cell. Subsequently, scMoC identifies clusters of cells by merging clusterings derived from both data domains individually. We tested scMoC on datasets generated using different protocols with variable data sparsity levels. We show that, due to its imputation scheme, scMoC 1) is able to generate informative scATAC-seq data due to its RNA guided imputation strategy, and 2) results in integrated clusters based on both RNA and ATAC information that are biologically meaningful either from the RNA or from the ATAC perspective. Availability The code is freely available at: https://github.com/meltager/scmoc . Supplementary information Supplementary data are available online.
科研通智能强力驱动
Strongly Powered by AbleSci AI