复制(统计)
连锁不平衡
遗传关联
全基因组关联研究
荟萃分析
插补(统计学)
统计能力
计算机科学
联动装置(软件)
计算生物学
生物
遗传学
缺少数据
单核苷酸多态性
统计
医学
机器学习
数学
基因
基因型
病毒学
内科学
作者
Frank Dudbridge,Paul J. Newcombe
标识
DOI:10.1002/9781119487845.ch22
摘要
Replication and meta-analysis are routinely conducted when following up genome-wide association studies. Replication is a mandatory step in confirming the validity of associations. Technical replication verifies the integrity of the original data, while direct replication reproduces the association as closely as possible in independent data. Indirect replication, while not providing full support for the original association, provides evidence for its generalisability. A statistical effect that bears on the power of replication studies is the winner's curse, whereby the effect size seen in a discovery study tends to be more extreme than in truth. Several methods are available to correct for the winner's curse, depending on how associations are first identified and whether replication data are already available. Meta-analysis, in which results from several studies are combined, is increasingly important as individual studies have limited power to detect genetic associations. Although standard fixed and random effects models can be used, more powerful methods are available for detecting effects when there is heterogeneity across studies. Heterogeneity may arise from variation in linkage disequilibrium patterns, case ascertainment and interaction effects. Imputation allows the combination of studies that have genotyped different markers. Increasingly, consortia are being formed in advance of initial data analysis, allowing harmonisation of quality control criteria and sharing of individual-level data.
科研通智能强力驱动
Strongly Powered by AbleSci AI