非负矩阵分解
秩(图论)
采样(信号处理)
数学
算法
矩阵分解
集合(抽象数据类型)
因式分解
基质(化学分析)
计算机科学
组合数学
物理
程序设计语言
材料科学
复合材料
特征向量
滤波器(信号处理)
量子力学
计算机视觉
作者
Ragnhild Laursen,Asger Hobolth
摘要
Nonnegative matrix factorization (NMF) is a useful method to extract features from multivariate data, but an important and sometimes neglected concern is that NMF can result in nonunique solutions. Often, there exist a set of feasible solutions (SFS), which makes it more difficult to interpret the factorization. This problem is especially ignored in cancer genomics, where NMF is used to infer information about the mutational processes present in the evolution of cancer. In this paper the extent of nonuniqueness is investigated for two mutational counts data, and a new sampling algorithm that can find the SFS is introduced. Our sampling algorithm is easy to implement and applies to an arbitrary rank of NMF. This is in contrast to state of the art, where the NMF rank must be smaller than or equal to four. For lower ranks we show that our algorithm performs similar to the polygon inflation algorithm that is developed in relation to chemometrics. Furthermore, we show how the size of the SFS can have a high influence on the appearing variability of a solution. Our sampling algorithm is implemented in the R package SFS (https://github.com/ragnhildlaursen/SFS).
科研通智能强力驱动
Strongly Powered by AbleSci AI