iPromoter-CLA: Identifying promoters and their strength by deep capsule networks with bidirectional long short-term memory

发起人卷积神经网络计算生物学计算机科学人工神经网络鉴定（生物学）上游（联网）图层（电子） DNA测序人工智能 DNA 模式识别（心理学）基因生物遗传学基因表达电信纳米技术植物材料科学

作者

Zhimin Zhang,Jianping Zhao,Pi-Jing Wei,Chun-Hou Zheng

出处

期刊：Computer Methods and Programs in Biomedicine [Elsevier BV]
日期：2022-08-28 卷期号：226: 107087-107087 被引量：12

链接

nih.govdoi.org

标识

DOI：10.1016/j.cmpb.2022.107087

摘要

• A new two-layer promoter predictor called iPro2L-CLA to identify promoters and their strength. • In this study, we firstly proposed a new capsule network and recurrent neural network hybrid model to identify promoters and predict their strength. • Our model attains a cross-validation accuracy of 86% and 73.46% in prokaryotic promoter recognition and their strength prediction. The promoter is a fragment of DNA and a specific sequence with transcriptional regulation function in DNA. Promoters are located upstream at the transcription start site, which is used to initiate downstream gene expression. So far, promoter identification is mainly achieved by biological methods, which often require more effort. It has become a more effective classification and prediction method to identify promoter types through computational methods. In this study, we proposed a new capsule network and recurrent neural network hybrid model to identify promoters and predict their strength. Firstly, we used one-hot to encode DNA sequence. Secondly, we used three one-dimensional convolutional layers, a one-dimensional convolutional capsule layer and digit capsule layer to learn local features. Thirdly, a bidirectional long short-time memory was utilized to extract global features. Finally, we adopted the self-attention mechanism to improve the contribution of relatively important features, which further enhances the performance of the model. Our model attains a cross-validation accuracy of 86% and 73.46% in prokaryotic promoter recognition and their strength prediction, which showcases a better performance compared with the existing approaches in both the first layer promoter identification and the second layer promoter's strength prediction. our model not only combines convolutional neural network and capsule layer but also uses a self-attention mechanism to better capture hidden information features from the perspective of sequence. Thus, we hope that our model can be widely applied to other components.

求助该文献

最长约 10秒，即可获得该文献文件

iPromoter-CLA: Identifying promoters and their strength by deep capsule networks with bidirectional long short-term memory

今日热心研友