插补(统计学)
生物
基因分型
遗传学
连锁不平衡
基因组
单核苷酸多态性
人口
计算生物学
SNP公司
SNP基因分型
全基因组关联研究
核苷酸多型性
基因组学
全基因组测序
遗传关联
基因型
统计
缺少数据
基因
数学
社会学
人口学
作者
Sven E. Weber,Lennard Roscher-Ehrig,Tobias Kox,Amine Abbadi,Andreas Stahl,Rod J. Snowdon
出处
期刊:Genome
[NRC Research Press]
日期:2024-05-06
卷期号:67 (7): 210-222
标识
DOI:10.1139/gen-2023-0126
摘要
Advances in sequencing technology allow whole plant genomes to be sequenced with high quality. Combining genotypic and phenotypic data in genomic prediction helps breeders to select crossing partners in partially phenotyped populations. In plant breeding programs, the cost of sequencing entire breeding populations still exceeds available genotyping budgets. Hence, the method for genotyping is still mainly single nucleotide polymorphism (SNP) arrays; however, arrays are unable to assess the entire genome- and population-wide diversity. A compromise involves genotyping the entire population using an SNP array and a subset of the population with whole-genome sequencing. Both datasets can then be used to impute markers from whole-genome sequencing onto the entire population. Here, we evaluate whether imputation of whole-genome sequencing data enhances genomic predictions, using data from a nested association mapping population of rapeseed ( Brassica napus). Employing two cross-validation schemes that mimic scenarios for the prediction of close and distant relatives, we show that imputed marker data do not significantly improve prediction accuracy, likely due to redundancy in relationship estimates and imputation errors. In simulation studies, only small improvements were observed, further corroborating the findings. We conclude that SNP arrays are already equipped with the information that is added by imputation through relationship and linkage disequilibrium.
科研通智能强力驱动
Strongly Powered by AbleSci AI