计算机科学
蛋白质组学
插补(统计学)
算法
数据挖掘
机器学习
缺少数据
化学
生物化学
基因
作者
Jessica Conforti,Constantine C. Breus,Elyssia S. Gallagher
标识
DOI:10.1021/acs.jproteome.5c00429
摘要
Comparative proteomics experiments reveal biomarkers by using statistical tests to determine proteins expressed with a higher abundance in one sample versus another. However, comparative experiments can be complicated by variability in all aspects of proteomics workflows. To account for variability, software for database searching contains retention-time alignment and imputation algorithms to correct for retention-time shifts and assign abundances to missing proteins. While these algorithms improve quantification and reduce processing time, we hypothesize that they alter statistical comparisons between samples when samples are searched together. Herein, we search the data for different cleanup methods or single proteins either separately or together in Progenesis Qi for proteomics database searching software. Our results show that searching samples together increases the number of identifications in each sample, enhances the protein similarity between samples, and leads to false transfers. Furthermore, we demonstrate that searching samples together affects protein abundance, differentially expressed proteins, and confidence scores due to retention-time alignment and imputation algorithms. Ultimately, we highlight that careful consideration of the search design is necessary to determine biomarkers in comparative proteomics experiments. Search results from the reanalyzed data set comparing sample-cleanup methods (MSV000094130) and single-protein data have been deposited into MassIVE (MSV000096112) and ProteomeXchange (PXD056868).
科研通智能强力驱动
Strongly Powered by AbleSci AI