生物
基因组
基因分型
计算生物学
后转座子
人口
遗传学
系统发育树
注释
参考基因组
结构变异
基因组学
进化生物学
转座因子
基因
基因型
人口学
社会学
作者
Ming Hu,Penglong Wan,Chengjie Chen,Shihua Tang,Jiahao Chen,Lin‐Fa Wang,Mahul Chakraborty,Yongfeng Zhou,Jinfeng Chen,Brandon S. Gaut,J. J. Emerson,Yi Liao
标识
DOI:10.1093/molbev/msaf180
摘要
Comparisons of complete genome assemblies offer a direct procedure for characterizing all genetic differences among them. However, existing tools are often limited to specific aligners or optimized for specific organisms, narrowing their applicability, particularly for large and repetitive plant genomes. Here, we introduce SVGAP, a pipeline for structural variant (SV) discovery, genotyping, and annotation from high-quality genome assemblies at the population level. Through extensive benchmarks using simulated SV datasets at individual, population, and phylogenetic contexts, we demonstrate that SVGAP performs favorably relative to existing tools in SV discovery. Additionally, SVGAP is one of the few tools to address the challenge of genotyping SVs within large assembled genome samples, and it generates fully genotyped VCF files. Applying SVGAP to 26 maize genomes revealed hidden genomic diversity in centromeres, driven by abundant insertions of centromere-specific LTR-retrotransposons. The output of SVGAP is well-suited for pan-genome construction and facilitates the interpretation of previously unexplored genomic regions.
科研通智能强力驱动
Strongly Powered by AbleSci AI