连续性
顺序装配
计算机科学
序列(生物学)
图形
单倍型
计算生物学
特征(语言学)
生物
算法
理论计算机科学
序列母题
共识序列
编码
序列比对
有向图
作者
Haoyu Cheng,Gregory T. Concepcion,Xiaowen Feng,Haowen Zhang,Heng Li
出处
期刊:Nature Methods
[Nature Portfolio]
日期:2021-02-01
卷期号:18 (2): 170-175
被引量:5257
标识
DOI:10.1038/s41592-020-01056-5
摘要
Haplotype-resolved de novo assembly is the ultimate solution to the study of sequence variations in a genome. However, existing algorithms either collapse heterozygous alleles into one consensus copy or fail to cleanly separate the haplotypes to produce high-quality phased assemblies. Here we describe hifiasm, a new de novo assembler that takes advantage of long high-fidelity sequence reads to faithfully represent the haplotype information in a phased assembly graph. Unlike other graph-based assemblers that only aim to maintain the contiguity of one haplotype, hifiasm strives to preserve the contiguity of all haplotypes. This feature enables the development of a graph trio binning algorithm that greatly advances over standard trio binning. On three human and five non-human datasets, including California redwood with a $\sim$30-gigabase hexaploid genome, we show that hifiasm frequently delivers better assemblies than existing tools and consistently outperforms others on haplotype-resolved assembly.
科研通智能强力驱动
Strongly Powered by AbleSci AI