计算机科学
管道(软件)
DNA测序
吞吐量
变压器
字错误率
人工智能
数据挖掘
基因
无线
生物
电信
工程类
遗传学
电气工程
电压
程序设计语言
作者
Shuwei Li,Zhiru Guo,Ao Shen,Zheqi Yu,Wei Mao,Shaobo Luo,Hao Yu
标识
DOI:10.1109/embc48229.2022.9871730
摘要
Gene sequencing technology is a tool which greatly impacts modern biology and medicine. The next-generation sequencing (NGS) lies at the heart of gene sequencing for its massively increasing throughput, but it is difficult to analyze the large quantities of fluorescent images with high accuracy because the fluorescent signals are weak with varying noise signals, and current designs are limited on accuracy and speed. In this paper, we proposed a novel deep learning based gene sequencing pipeline with semi-automatic labelling method. The obtained results are promising, especially on the high-density data, as the BaseFormer surpasses the traditional methods in terms of cluster quality (Q30: 88 %), throughput (16.5% better), and with similar and low error rate (down to 0.137% on average, best at 0.068 % on high-density data).
科研通智能强力驱动
Strongly Powered by AbleSci AI