虚假关系
微生物群
计算机科学
分位数回归
可视化
回归
分位数
数据挖掘
批处理
计量经济学
统计
机器学习
生物
数学
生物信息学
程序设计语言
作者
Wodan Ling,Ni Zhao,Anju Lulla,Anna Plantinga,Weijia Fu,Angela Zhang,Hongjiao Liu,Zhigang Li,Jun Chen,Timothy W. Randolph,Wei Li Adeline Koay,James R. White,Lenore J. Launer,Anthony A. Fodor,Katie A. Meyer,Michael C. Wu
标识
DOI:10.1101/2021.09.23.461592
摘要
Abstract Batch effects in microbiome data arise from differential processing of specimens and can lead to spurious findings and obscure true signals. Most existing strategies for mitigating batch effects rely on approaches designed for genomic analysis, failing to address the zero-inflated and over-dispersed microbiome data. Strategies tailored for microbiome data are restricted to association testing, failing to allow other analytic goals such as visualization. We develop the Conditional Quantile Regression (ConQuR) approach to remove microbiome batch effects using a two-part quantile regression model. It is a fundamental advancement in the field because it is the first comprehensive method that accommodates the complex distributions of microbial read counts, and it generates batch-removed zero-inflated read counts that can be used in and benefit all usual subsequent analyses. We apply ConQuR to real microbiome data sets and demonstrate its state-of-the-art performance in removing batch effects while preserving or even amplifying the signals of interest.
科研通智能强力驱动
Strongly Powered by AbleSci AI