非负矩阵分解
反褶积
矩阵分解
计算机科学
分类
理论(学习稳定性)
约束(计算机辅助设计)
生物信息学
表达式(计算机科学)
基质(化学分析)
数据挖掘
模式识别(心理学)
算法
计算生物学
人工智能
数学
基因
机器学习
生物
化学
遗传学
特征向量
物理
几何学
量子力学
色谱法
程序设计语言
作者
M. D. Sweeney,Luke A. Torre-Healy,Virginia Acha,Margaret A Hall,Lucie Chrastecka,Alisa Yurovsky,Richard A. Moffitt
标识
DOI:10.1109/tcbb.2023.3296979
摘要
Gene expression analysis of samples with mixed cell types only provides limited insight to the characteristics of specific tissues. In silico deconvolution can be applied to extract cell type specific expression, thus avoiding prohibitively expensive techniques such as cell sorting or single-cell sequencing. Non-negative matrix factorization (NMF) is a deconvolution method shown to be useful for gene expression data, in part due to its constraint of non-negativity. Unlike other methods, NMF provides the capability to deconvolve without prior knowledge of the components of the model. However, NMF is not guaranteed to provide a globally unique solution. In this work, we present FaStaNMF, a method that balances achieving global stability of the NMF results, which is essential for inter-experiment and inter-lab reproducibility, with accuracy and speed. Results: FaStaNMF was applied to four datasets with known ground truth, created based on publicly available data or by using our simulation infrastructure, RNAGinesis. We assessed FaStaNMF on three criteria - speed, accuracy, and stability, and it favorably compared to the standard approach of achieving reproduceable results with NMF. We expect that FaStaNMF can be applied successfully to a wide array of biological data, such as different tumor/immune and other disease microenvironments.
科研通智能强力驱动
Strongly Powered by AbleSci AI