Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy

基因组选择选择（遗传算法）主成分分析人口生物聚类分析集合（抽象数据类型）统计遗传学计算机科学数学人工智能基因型单核苷酸多态性医学程序设计语言环境卫生基因

作者

Adam Norman,Julian Taylor,James Edwards,Haydn Kuchel

出处

期刊：G3: Genes, Genomes, Genetics [Genetics Society of America]
日期：2018-09-01 卷期号：8 (9): 2889-2899 被引量：126

链接

g3journal.org g3journal.org doaj.org europepmc.org europepmc.org handle.net edu.au nih.gov nih.govdoi.org

标识

DOI：10.1534/g3.118.200311

摘要

Genomic selection applied to plant breeding enables earlier estimates of a line's performance and significant reductions in generation interval. Several factors affecting prediction accuracy should be well understood if breeders are to harness genomic selection to its full potential. We used a panel of 10,375 bread wheat (Triticum aestivum) lines genotyped with 18,101 SNP markers to investigate the effect and interaction of training set size, population structure and marker density on genomic prediction accuracy. Through assessing the effect of training set size we showed the rate at which prediction accuracy increases is slower beyond approximately 2,000 lines. The structure of the panel was assessed via principal component analysis and K-means clustering, and its effect on prediction accuracy was examined through a novel cross-validation analysis according to the K-means clusters and breeding cohorts. Here we showed that accuracy can be improved by increasing the diversity within the training set, particularly when relatedness between training and validation sets is low. The breeding cohort analysis revealed that traits with higher selection pressure (lower allelic diversity) can be more accurately predicted by including several previous cohorts in the training set. The effect of marker density and its interaction with population structure was assessed for marker subsets containing between 100 and 17,181 markers. This analysis showed that response to increased marker density is largest when using a diverse training set to predict between poorly related material. These findings represent a significant resource for plant breeders and contribute to the collective knowledge on the optimal structure of calibration panels for genomic prediction.

求助该文献

Optimising Genomic Selection in Wheat: Effect of Marker Density, Population Size and Population Structure on Prediction Accuracy

今日热心研友