蛋白质组
集合(抽象数据类型)
鉴定(生物学)
计算机科学
酵母
数据集
传输(计算)
错误发现率
样品(材料)
算法
计算生物学
数据挖掘
生物
化学
生物信息学
色谱法
人工智能
生物化学
并行计算
程序设计语言
基因
植物
作者
Matthew Lim,João A. Paulo,Steven P. Gygi
标识
DOI:10.1021/acs.jproteome.9b00492
摘要
Stochasticity between independent LC-MS/MS runs is a challenging problem in the field of proteomics, resulting in significant missing values (i.e., abundance measurements) among observed peptides. To address this issue, several approaches have been developed including computational methods such as MaxQuant's match-between-runs (MBR) algorithm. Often dozens of runs are all considered at once by MBR, transferring identifications from any one run to any of the others. To evaluate the error associated with these transfer events, we created a two-sample/two-proteome approach. In this way, samples containing no yeast lysate (n = 20) were assessed for false identification transfers from samples containing yeast (n = 20). While MBR increased the total number of spectral identifications by ∼40%, we also found that 44% of all identified yeast proteins had identifications transferred to at least one sample without yeast. However, of these only 2.7% remained in the final data set after applying the MaxQuant LFQ algorithm. We conclude that false transfers by MBR are plentiful, but few are retained in the final data set.
科研通智能强力驱动
Strongly Powered by AbleSci AI