组学
计算机科学
频数推理
贝叶斯概率
数据挖掘
软件
机器学习
贝叶斯推理
数据科学
人工智能
生物信息学
生物
程序设计语言
作者
Himel Mallick,Anupreet Porwal,Satabdi Saha,Piyali Basak,Vladimir Svetnik,Erina Paul
标识
DOI:10.1101/2022.11.06.514786
摘要
Abstract With the growing commonality of multi-omics datasets, there is now increasing evidence that integrated omics profiles lead to the more efficient discovery of clinically actionable biomarkers that enable better disease outcome prediction and patient stratification. Several methods exist to perform host phenotype prediction from crosssectional, single-omics data modalities but decentralized frameworks that jointly analyze multiple time-dependent omics data to highlight the integrative and dynamic impact of repeatedly measured biomarkers are currently limited. In this article, we propose a novel Bayesian ensemble method to consolidate prediction by combining information across several longitudinal and cross-sectional omics data layers. Unlike existing frequentist paradigms, our approach enables uncertainty quantification in prediction as well as interval estimation for a variety of quantities of interest based on posterior summaries. We apply our method to four published multi-omics datasets and demonstrate that it recapitulates known biology in addition to providing novel insights while also outperforming existing methods in estimation, prediction, and uncertainty quantification. Our open-source software is publicly available at https://github.com/himelmallick/IntegratedLearner .
科研通智能强力驱动
Strongly Powered by AbleSci AI