堆积
计算生物学
核糖核酸
计算机科学
N6-甲基腺苷
生物
编码器
人工智能
DNA
遗传学
化学
基因
甲基化
有机化学
甲基转移酶
操作系统
作者
Qianyue Li,Xin Cheng,Chen Song,Taigang Liu
出处
期刊:Symmetry
[Multidisciplinary Digital Publishing Institute]
日期:2023-03-15
卷期号:15 (3): 731-731
被引量:12
摘要
As the most abundant RNA methylation modification, N6-methyladenosine (m6A) could regulate asymmetric and symmetric division of hematopoietic stem cells and play an important role in various diseases. Therefore, the precise identification of m6A sites around the genomes of different species is a critical step to further revealing their biological functions and influence on these diseases. However, the traditional wet-lab experimental methods for identifying m6A sites are often laborious and expensive. In this study, we proposed an ensemble deep learning model called m6A-BERT-Stacking, a powerful predictor for the detection of m6A sites in various tissues of three species. First, we utilized two encoding methods, i.e., di ribonucleotide index of RNA (DiNUCindex_RNA) and k-mer word segmentation, to extract RNA sequence features. Second, two encoding matrices together with the original sequences were respectively input into three different deep learning models in parallel to train three sub-models, namely residual networks with convolutional block attention module (Resnet-CBAM), bidirectional long short-term memory with attention (BiLSTM-Attention), and pre-trained bidirectional encoder representations from transformers model for DNA-language (DNABERT). Finally, the outputs of all sub-models were ensembled based on the stacking strategy to obtain the final prediction of m6A sites through the fully connected layer. The experimental results demonstrated that m6A-BERT-Stacking outperformed most of the existing methods based on the same independent datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI