基因组
顺序装配
计算生物学
拟南芥
纳米孔测序
计算机科学
正确性
生物
遗传学
基因
程序设计语言
基因表达
突变体
转录组
作者
Lia Obinu,Urmi Trivedi,Andrea Porceddu
标识
DOI:10.3389/fbinf.2024.1462923
摘要
The implementation of Hi-C reads in the de novo genome assembly process allows the ordering of large regions of the genome in scaffolds and the generation of chromosome-level assemblies. Several bioinformatics tools have been developed for genome scaffolding with Hi-C, and each tool has advantages and disadvantages that need to be carefully evaluated before their adoption. We generated two de novo assemblies of Arabidopsis thaliana obtained from the same raw PacBio HiFi and Oxford Nanopore Technologies data. We scaffolded the assemblies implementing Hi-C reads with the scaffolders 3D-DNA, SALSA2, and YaHS, with the aim of identifying the tool providing the most accurate assembly. The scaffolded assemblies were evaluated according to contiguity, completeness, accuracy, and structural correctness. In our analysis, YaHS proved to be the best-performing bioinformatics tool for scaffolding de novo genome assemblies in Arabidopsis thaliana.
科研通智能强力驱动
Strongly Powered by AbleSci AI