主成分分析
计算机科学
可扩展性
推论
计算生物学
数据挖掘
降维
单细胞测序
树(集合论)
比例(比率)
生物
人工智能
外显子组测序
遗传学
突变
数学
数据库
量子力学
基因
物理
数学分析
作者
Ziwei Chen,Fuzhou Gong,Lin Wan,Liang Ma
出处
期刊:Bioinformatics
[Oxford University Press]
日期:2020-03-06
卷期号:36 (11): 3299-3306
被引量:29
标识
DOI:10.1093/bioinformatics/btaa172
摘要
Abstract Motivation Single-cell sequencing (SCS) data provide unprecedented insights into intratumoral heterogeneity. With SCS, we can better characterize clonal genotypes and reconstruct phylogenetic relationships of tumor cells/clones. However, SCS data are often error-prone, making their computational analysis challenging. Results To infer the clonal evolution in tumor from the error-prone SCS data, we developed an efficient computational framework, termed RobustClone. It recovers the true genotypes of subclones based on the extended robust principal component analysis, a low-rank matrix decomposition method, and reconstructs the subclonal evolutionary tree. RobustClone is a model-free method, which can be applied to both single-cell single nucleotide variation (scSNV) and single-cell copy-number variation (scCNV) data. It is efficient and scalable to large-scale datasets. We conducted a set of systematic evaluations on simulated datasets and demonstrated that RobustClone outperforms state-of-the-art methods in large-scale data both in accuracy and efficiency. We further validated RobustClone on two scSNV and two scCNV datasets and demonstrated that RobustClone could recover genotype matrix and infer the subclonal evolution tree accurately under various scenarios. In particular, RobustClone revealed the spatial progression patterns of subclonal evolution on the large-scale 10X Genomics scCNV breast cancer dataset. Availability and implementation RobustClone software is available at https://github.com/ucasdp/RobustClone. Contact lwan@amss.ac.cn or maliang@ioz.ac.cn Supplementary information Supplementary data are available at Bioinformatics online.
科研通智能强力驱动
Strongly Powered by AbleSci AI