德布鲁恩序列
德布鲁因图
组合数学
计算机科学
二进制对数
图形
离散数学
常量(计算机编程)
数据结构
序列(生物学)
数学
遗传学
生物
程序设计语言
作者
Alexander Bowe,Taku Onodera,Kunihiko Sadakane,Tetsuo Shibuya
标识
DOI:10.1007/978-3-642-33122-0_18
摘要
We propose a new succinct de Bruijn graph representation. If the de Bruijn graph of k-mers in a DNA sequence of length N has m edges, it can be represented in 4m + o(m) bits. This is much smaller than existing ones. The numbers of outgoing and incoming edges of a node are computed in constant time, and the outgoing and incoming edge with given label are found in constant time and $\mathcal{O}(k)$ time, respectively. The data structure is constructed in $\mathcal{O}(Nk \log m/\log\log m)$ time using no additional space.
科研通智能强力驱动
Strongly Powered by AbleSci AI