Comprehensive Assessment of Genotype Imputation Performance

插补(统计学) 次等位基因频率 单核苷酸多态性 1000基因组计划 SNP公司 生物 全基因组关联研究 遗传关联 单倍型 基因型 遗传学 统计 计算生物学 缺少数据 数学 基因
作者
Shuo Shi,Yuan Ni,Ming Yang,Zhenglin Du,Jinyue Wang,Xin Sheng,Jiayan Wu,Jingfa Xiao
出处
期刊:Human Heredity [S. Karger AG]
卷期号:83 (3): 107-116 被引量:54
标识
DOI:10.1159/000489758
摘要

Genotype imputation is a process of estimating missing ge-notypes from the haplotype or genotype reference panel. It can effectively boost the power of detecting single nucleotide polymorphisms (SNPs) in genome-wide association studies, integrate multi-studies for meta-analysis, and be applied in fine-mapping studies. The performance of genotype imputation is affected by many factors, including software, reference selection, sample size, and SNP density/sequencing coverage. A systematical evaluation of the imputation performance of current popular software will benefit future studies. Here, we evaluate imputation performances of Beagle4.1, IMPUTE2, MACH+Minimac3, and SHAPEIT2+ IM-PUTE2 using test samples of East Asian ancestry and references of the 1000 Genomes Project. The result indicated the accuracy of IMPUTE2 (99.18%) is slightly higher than that of the others (Beagle4.1: 98.94%, MACH+Minimac3: 98.51%, and SHAPEIT2+IMPUTE2: 99.08%). To achieve good and stable imputation quality, the minimum requirement of SNP density needs to be > 200/Mb. The imputation accuracies of IMPUTE2 and Beagle4.1 were under the minor influence of the study sample size. The contribution extent of reference to genotype imputation performance relied on software selection. We assessed the imputation performance on SNPs generated by next-generation whole genome sequencing and found that SNP sets detected by sequencing with 15× depth could be mostly got by imputing from the haplotype reference panel of the 1000 Genomes Project based on SNP data detected by sequencing with 4× depth. All of the imputation software had a weaker performance in low minor allele frequency SNP regions because of the bias of reference or software. In the future, more comprehensive reference panels or new algorithm developments may rise up to this challenge.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Ava应助octopus采纳,获得10
刚刚
钩钩颂发布了新的文献求助10
1秒前
赘婿应助义气的静槐采纳,获得10
1秒前
沙寂月影发布了新的文献求助10
1秒前
FashionBoy应助twk采纳,获得10
2秒前
李嘉发布了新的文献求助10
2秒前
田様应助chake采纳,获得10
3秒前
hhy完成签到 ,获得积分10
3秒前
宋瓜发布了新的文献求助10
4秒前
江泽应助白冰蓝采纳,获得20
5秒前
斯文念波完成签到,获得积分20
5秒前
5秒前
7秒前
斯文念波发布了新的文献求助30
7秒前
Janx完成签到,获得积分10
8秒前
心无杂念完成签到 ,获得积分10
10秒前
白白拜拜完成签到,获得积分10
10秒前
慕青应助李嘉采纳,获得10
11秒前
NANA1216发布了新的文献求助10
11秒前
13秒前
研友_ZAx4Gn完成签到,获得积分10
14秒前
酷波er应助拉拉采纳,获得10
15秒前
搜集达人应助Huang采纳,获得10
15秒前
16秒前
19秒前
宋瓜完成签到,获得积分10
19秒前
happyboy2008发布了新的文献求助10
20秒前
RobiN完成签到,获得积分0
20秒前
NANA1216完成签到,获得积分10
22秒前
鲤鱼忆曼发布了新的文献求助10
22秒前
23秒前
科研通AI2S应助fqf采纳,获得10
23秒前
23秒前
Tsang发布了新的文献求助10
27秒前
领导范儿应助小贤采纳,获得10
28秒前
Huang发布了新的文献求助10
28秒前
Beebee24完成签到,获得积分10
30秒前
柳叶刀完成签到 ,获得积分10
32秒前
Phy应助mingge采纳,获得10
32秒前
在水一方应助甜甜盼夏采纳,获得10
33秒前
高分求助中
Sustainable Land Management: Strategies to Cope with the Marginalisation of Agriculture 1000
Corrosion and Oxygen Control 600
Yaws' Handbook of Antoine coefficients for vapor pressure 500
Python Programming for Linguistics and Digital Humanities: Applications for Text-Focused Fields 500
行動データの計算論モデリング 強化学習モデルを例として 500
Johann Gottlieb Fichte: Die späten wissenschaftlichen Vorlesungen / IV,1: ›Transzendentale Logik I (1812)‹ 400
The role of families in providing long term care to the frail and chronically ill elderly living in the community 380
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2555762
求助须知:如何正确求助?哪些是违规求助? 2179804
关于积分的说明 5621458
捐赠科研通 1901159
什么是DOI,文献DOI怎么找? 949631
版权声明 565592
科研通“疑难数据库(出版商)”最低求助积分说明 504754