错误检测和纠正
DNA测序
冗余(工程)
计算生物学
计算机科学
字错误率
DNA
杂交测序
深度测序
杂交基因组组装
DNA纳米球测序
算法
DNA测序器
遗传学
生物
霰弹枪测序
基因组文库
基序列
基因组
基因
人工智能
操作系统
作者
Zitian Chen,Wenxiong Zhou,Shuo Qiao,Kang Li,Haifeng Duan,Xiaohui Xie,Yanyi Huang
摘要
Eliminating errors in next-generation DNA sequencing has proved challenging. Here we present error-correction code (ECC) sequencing, a method to greatly improve sequencing accuracy by combining fluorogenic sequencing-by-synthesis (SBS) with an information theory-based error-correction algorithm. ECC embeds redundancy in sequencing reads by creating three orthogonal degenerate sequences, generated by alternate dual-base reactions. This is similar to encoding and decoding strategies that have proved effective in detecting and correcting errors in information communication and storage. We show that, when combined with a fluorogenic SBS chemistry with raw accuracy of 98.1%, ECC sequencing provides single-end, error-free sequences up to 200 bp. ECC approaches should enable accurate identification of extremely rare genomic variations in various applications in biology and medicine.
科研通智能强力驱动
Strongly Powered by AbleSci AI