染色质
计算生物学
DNA甲基化
生物
表观遗传学
表观遗传学
计算机科学
基因
甲基化
差异甲基化区
基因表达
表观基因组
基因组
遗传学
R包
组蛋白
生物导体
作者
Jiaxuan Wangwu,Zexuan Sun,Zhixiang Lin
出处
期刊:Bioinformatics
[Oxford University Press]
日期:2021-06-04
卷期号:37 (21): 3874-3880
被引量:1
标识
DOI:10.1093/bioinformatics/btab426
摘要
MOTIVATION The advancement in technologies and the growth of available single-cell datasets motivate integrative analysis of multiple single-cell genomic datasets. Integrative analysis of multimodal single-cell datasets combines complementary information offered by single-omic datasets and can offer deeper insights on complex biological process. Clustering methods that identify the unknown cell types are among the first few steps in the analysis of single-cell datasets, and they are important for downstream analysis built upon the identified cell types. RESULTS We propose scAMACE for the integrative analysis and clustering of single-cell data on chromatin accessibility, gene expression and methylation. We demonstrate that cell types are better identified and characterized through analyzing the three data types jointly. We develop an efficient expectationmaximization (EM) algorithm to perform statistical inference, and evaluate our methods on both simulation study and real data applications. We also provide the GPU implementation of scAMACE, making it scalable to large datasets. AVAILABILITY The software and datasets are available at https://github.com/cuhklinlab/scAMACE_py (python implementation) and https://github.com/cuhklinlab/scAMACE (R implementation). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
科研通智能强力驱动
Strongly Powered by AbleSci AI