假基因
计算生物学
基因复制
基因
遗传学
生物
序列(生物学)
基因组
等位基因
节段重复
代表(政治)
水准点(测量)
基因转化
基因组学
拷贝数变化
序列分析
计算机科学
人类基因组
序列比对
作者
Hongyu Ding,Fritz J. Sedlazeck,Christos Proukakis,Caoimhe Morley,Marco Toffoli,Anthony H.V. Schapira,Zhirui Liao,Lianrong Pu,Shanfeng Zhu
标识
DOI:10.1038/s41467-025-67096-7
摘要
Sequence alignment is essential for genomic research and clinical diagnostics, yet detecting complex rearrangements such as inversions, duplications, and gene conversions remains challenging due to allele complexity and limitations of current methods. We introduce VACmap, a non-linear mapping approach to enhance the detection and representation of all genetic variations. VACmap improves duplication detection from 20% to 90% in the Challenging Medically-Relevant Genes (CMRG) benchmark and improves characterization of complex inversions in repetitive regions and gene conversion events. It improves resolving clinically significant loci, including the LPA gene (with repetitive KIV-2 units linked to coronary heart disease), GBA1 and STRC genes (risk factors for Parkinson's disease and hearing loss, respectively, affected by pseudogene recombination with GBAP1 and STRCP1). Here, we show that VACmap delivers better alignment accuracy and SV detection, providing a robust tool for genomic analysis and clinical insights, with potential to advance understanding of genetic diversity and disease mechanisms.
科研通智能强力驱动
Strongly Powered by AbleSci AI