生物
计算生物学
顺序装配
基因组
遗传学
基因
转录组
基因表达
作者
Esmaeil Forouzan,Masoumeh Sadat Mousavi Maleki,Ali Asghar Karkhane,Bagher Yakhchali
标识
DOI:10.1016/j.mimet.2017.09.008
摘要
Next generation sequencing (NGS) technologies are revolutionizing biology, with Illumina being the most popular NGS platform. Short read assembly is a critical part of most genome studies using NGS. Hence, in this study, the performance of nine well-known assemblers was evaluated in the assembly of seven different microbial genomes. Effect of different read coverage and k-mer parameters on the quality of the assembly were also evaluated on both simulated and actual read datasets. Our results show that the performance of assemblers on real and simulated datasets could be significantly different, mainly because of coverage bias. According to outputs on actual read datasets, for all studied read coverages (of 7 ×, 25 × and 100 ×), SPAdes and IDBA-UD clearly outperformed other assemblers based on NGA50 and accuracy metrics. Velvet is the most conservative assembler with the lowest NGA50 and error rate.
科研通智能强力驱动
Strongly Powered by AbleSci AI