主成分分析
计算机科学
修剪
先验与后验
排列(音乐)
集合(抽象数据类型)
阈值
统计
多重比较问题
模式识别(心理学)
人工智能
数据挖掘
数学
生物
程序设计语言
农学
哲学
物理
图像(数学)
认识论
声学
作者
Brandon J. Coombes,Alexander Ploner,Sarah E. Bergen,Joanna M. Biernacka
摘要
Abstract Polygenic risk scores (PRSs) have become an increasingly popular approach for demonstrating polygenic influences on complex traits and for establishing common polygenic signals between different traits. PRSs are typically constructed using pruning and thresholding (P+T), but the best choice of parameters is uncertain; thus multiple settings are used and the best is chosen. Optimization can lead to inflated Type I error. Permutation procedures can correct this, but they can be computationally intensive. Alternatively, a single parameter setting can be chosen a priori for the PRS, but choosing suboptimal settings results in loss of power. We propose computing PRSs under a range of parameter settings, performing principal component analysis (PCA) on the resulting set of PRSs, and using the first PRS–PC in association tests. The first PC reweights the variants included in the PRS to achieve maximum variation over all PRS settings used. Using simulations and a real data application to study PRS association with bipolar disorder and psychosis in bipolar disorder, we compare the performance of the proposed PRS–PCA approach with a permutation test and an a priori selected p ‐value threshold. The PRS–PCA approach is simple to implement, outperforms the other strategies in most scenarios, and provides an unbiased estimate of prediction performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI