主成分分析
降维
孟德尔随机化
统计
多重共线性
数学
回归
模式识别(心理学)
计算机科学
维数之咒
贝叶斯定理
人工智能
回归分析
贝叶斯概率
生物
基因型
基因
生物化学
遗传变异
作者
Vasilios Karageorgiou,Dipender Gill,Jack Bowden,Verena Zuber
出处
期刊:eLife
[eLife Sciences Publications Ltd]
日期:2023-04-19
卷期号:12
被引量:12
摘要
Multivariable Mendelian randomisation (MVMR) is an instrumental variable technique that generalises the MR framework for multiple exposures. Framed as a regression problem, it is subject to the pitfall of multicollinearity. The bias and efficiency of MVMR estimates thus depends heavily on the correlation of exposures. Dimensionality reduction techniques such as principal component analysis (PCA) provide transformations of all the included variables that are effectively uncorrelated. We propose the use of sparse PCA (sPCA) algorithms that create principal components of subsets of the exposures with the aim of providing more interpretable and reliable MR estimates. The approach consists of three steps. We first apply a sparse dimension reduction method and transform the variant-exposure summary statistics to principal components. We then choose a subset of the principal components based on data-driven cutoffs, and estimate their strength as instruments with an adjusted F -statistic. Finally, we perform MR with these transformed exposures. This pipeline is demonstrated in a simulation study of highly correlated exposures and an applied example using summary data from a genome-wide association study of 97 highly correlated lipid metabolites. As a positive control, we tested the causal associations of the transformed exposures on coronary heart disease (CHD). Compared to the conventional inverse-variance weighted MVMR method and a weak instrument robust MVMR method (MR GRAPPLE), sparse component analysis achieved a superior balance of sparsity and biologically insightful grouping of the lipid traits.
科研通智能强力驱动
Strongly Powered by AbleSci AI