计算机科学
变压器
数据压缩
压缩比
编码器
解码方法
压缩(物理)
数据挖掘
人工智能
机器学习
算法
电压
工程类
材料科学
电气工程
复合材料
内燃机
汽车工程
操作系统
作者
Zhanbei Cui,Tongda Xu,Jia Wang,Yu Liao,Yan Wang
标识
DOI:10.1109/icassp48485.2024.10448360
摘要
The development of gene sequencing technology sparks an explosive growth of gene data. Thus, the storage of gene data has become an important issue. Recently, researchers begin to investigate deep learning-based gene data compression, which outperforms general traditional methods. In this paper, we propose a transformer-based gene compression method named GeneFormer. Specifically, we first introduce a modified transformer encoder with latent array to eliminate the dependency of the nucleotide sequence. Then, we design a multi-level-grouping method to accelerate and improve the compression process. Experimental results on real-world datasets show that our method achieves significantly better compression ratio compared with state-of-the-art method, and the decoding speed is significantly faster than all existing learning-based gene compression methods. We will release our code on github once the paper is accepted.
科研通智能强力驱动
Strongly Powered by AbleSci AI