Comprehensive Assessment of Genotype Imputation Performance

插补(统计学) 次等位基因频率 单核苷酸多态性 1000基因组计划 SNP公司 生物 全基因组关联研究 遗传关联 单倍型 基因型 遗传学 统计 计算生物学 缺少数据 数学 基因
作者
Shuo Shi,Na Yuan,Ming Yang,Zhenglin Du,Jinyue Wang,Xin Sheng,Jiayan Wu,Jingfa Xiao
出处
期刊:Human Heredity [Karger Publishers]
卷期号:83 (3): 107-116 被引量:64
标识
DOI:10.1159/000489758
摘要

Genotype imputation is a process of estimating missing ge-notypes from the haplotype or genotype reference panel. It can effectively boost the power of detecting single nucleotide polymorphisms (SNPs) in genome-wide association studies, integrate multi-studies for meta-analysis, and be applied in fine-mapping studies. The performance of genotype imputation is affected by many factors, including software, reference selection, sample size, and SNP density/sequencing coverage. A systematical evaluation of the imputation performance of current popular software will benefit future studies. Here, we evaluate imputation performances of Beagle4.1, IMPUTE2, MACH+Minimac3, and SHAPEIT2+ IM-PUTE2 using test samples of East Asian ancestry and references of the 1000 Genomes Project. The result indicated the accuracy of IMPUTE2 (99.18%) is slightly higher than that of the others (Beagle4.1: 98.94%, MACH+Minimac3: 98.51%, and SHAPEIT2+IMPUTE2: 99.08%). To achieve good and stable imputation quality, the minimum requirement of SNP density needs to be > 200/Mb. The imputation accuracies of IMPUTE2 and Beagle4.1 were under the minor influence of the study sample size. The contribution extent of reference to genotype imputation performance relied on software selection. We assessed the imputation performance on SNPs generated by next-generation whole genome sequencing and found that SNP sets detected by sequencing with 15× depth could be mostly got by imputing from the haplotype reference panel of the 1000 Genomes Project based on SNP data detected by sequencing with 4× depth. All of the imputation software had a weaker performance in low minor allele frequency SNP regions because of the bias of reference or software. In the future, more comprehensive reference panels or new algorithm developments may rise up to this challenge.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
NNNNN完成签到,获得积分10
1秒前
严晓黎发布了新的文献求助10
1秒前
淡定傲儿发布了新的文献求助10
3秒前
科研通AI6.1应助lawang采纳,获得10
3秒前
科研通AI6.4应助lawang采纳,获得10
4秒前
科研通AI6.1应助lawang采纳,获得10
4秒前
Blue发布了新的文献求助10
4秒前
科研通AI6.1应助lawang采纳,获得10
4秒前
麻辣烫加麻关注了科研通微信公众号
4秒前
4秒前
科研通AI6.2应助lawang采纳,获得10
4秒前
科研通AI6.4应助lawang采纳,获得10
4秒前
乐乐应助lawang采纳,获得10
5秒前
汉堡包应助lawang采纳,获得80
5秒前
科研通AI6.3应助lawang采纳,获得10
5秒前
清新的慕凝完成签到 ,获得积分10
7秒前
李健应助tangyuan采纳,获得10
7秒前
你好啊发布了新的文献求助10
8秒前
吴慧琼完成签到,获得积分10
8秒前
Owen应助淡定傲儿采纳,获得10
9秒前
BUTTOND完成签到 ,获得积分10
10秒前
大Doctor陈发布了新的文献求助10
10秒前
雷家完成签到,获得积分10
10秒前
洋溢完成签到,获得积分10
10秒前
Tomma完成签到,获得积分10
10秒前
Blue完成签到,获得积分10
12秒前
小小鱼完成签到,获得积分10
13秒前
13秒前
冷傲之玉完成签到,获得积分20
14秒前
yomi发布了新的文献求助10
14秒前
乐乐应助伊犁河谷的夏天采纳,获得10
16秒前
6666666666666666完成签到,获得积分10
16秒前
狂野含羞草完成签到,获得积分20
16秒前
16秒前
17秒前
长弓诘完成签到 ,获得积分10
18秒前
18秒前
唯梦发布了新的文献求助10
20秒前
20秒前
高分求助中
液晶指向矢仿真分析数据集 8888
Invited Discussant 63O and 64O 1000
Dr. Dirk Wiechmann on Lingual Orthodontics: Part I 888
Ideology and Meaning-Making under the Putin Regime 750
化工技术经济第五版电子版 500
Petrology and Plate Tectonics 500
Writing Systems 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 计算机科学 化学工程 生物化学 物理 内科学 复合材料 催化作用 光电子学 物理化学 电极 细胞生物学 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6877673
求助须知:如何正确求助?哪些是违规求助? 8577953
关于积分的说明 18227181
捐赠科研通 6258324
什么是DOI,文献DOI怎么找? 3053871
关于科研通互助平台的介绍 2062455
邀请新用户注册赠送积分活动 2031593