全基因组关联研究
遗传关联
人口结构
联想(心理学)
统计能力
计算生物学
进化生物学
人口
生物
计算机科学
遗传学
统计
单核苷酸多态性
数学
人口学
心理学
基因型
基因
社会学
心理治疗师
作者
Julie-Alexia Dias,Tony Chen,Xing Hua,Xiaoyu Wang,Alex A. Rodríguez,Ravi Madduri,Peter Kraft,Haoyu Zhang
出处
期刊:Cold Spring Harbor Laboratory - medRxiv
日期:2025-03-12
标识
DOI:10.1101/2025.03.11.25323772
摘要
Abstract The increasing availability of diverse biobanks has enabled multi-ancestry genome-wide association studies (GWAS), enhancing the discovery of genetic variants across traits and diseases. However, the choice of an optimal method remains debated due to challenges in statistical power differences across ancestral groups and approaches to account for population structure. Two primary strategies exist: (1) Pooled analysis, which combines individuals from all genetic backgrounds into a single dataset while adjusting for population stratification using principal components, increasing the sample size and statistical power but requiring careful control of population stratification. (2) Meta-analysis, which performs ancestry-group-specific GWAS and subsequently combines summary statistics, potentially capturing fine-scale population structure, but facing limitations in handling admixed individuals. Using large-scale simulations with varying sample sizes and ancestry compositions, we compare these methods alongside real data analyses of eight continuous and five binary traits from the UK Biobank (N≈324,000) and All of Us Research Program (N≈207,000). Our results demonstrate that pooled analysis generally exhibits better statistical power while effectively adjusting for population stratification. We further present a theoretical framework linking power differences to allele frequency variations across populations. These findings, validated across both biobanks, highlight pooled analysis as a robust and scalable strategy for multi-ancestry GWAS, improving genetic discovery while maintaining rigorous population structure control.
科研通智能强力驱动
Strongly Powered by AbleSci AI