计算机科学
嵌入
聚腺苷酸
代表(政治)
人工智能
深度学习
信号(编程语言)
模式识别(心理学)
机器学习
算法
理论计算机科学
化学
政治
基因
核糖核酸
生物化学
程序设计语言
法学
政治学
作者
Yanbu Guo,Hongxue Shen,Weihua Li,Chaoyang Li,Han Zhang
标识
DOI:10.1016/j.knosys.2022.109887
摘要
The polyadenylation process begins as the transcription of a gene terminates, and polyadenylation signal prediction by the genomic sequence is the key to understanding the mechanism of mRNA metabolism. Computational algorithms are widely proposed for polyadenylation signal prediction, but they mainly rely on sequences’ statistical and compositional characteristics. In this work, we conduct adaptively context-aware feature learning for polyadenylation signal prediction via a deep model and co-occurrence embedding. Specifically, we devise an activation function with dynamic parameters to control the scale of features channel-wisely for the positive and negative part value of neuron outputs, and then use the statistical information of k -mer occurrences as features of polyadenylation data, which is pre-trained by an unsupervised learning algorithm. Next, by combining multiscale convolutions with adaptive coefficients, long short-term memory networks jointly extract spatial and temporal contextual patterns. The patterns are fused to obtain contextual information by an identity skip connection and addition strategy. Experimental results demonstrate that our proposed model achieves more remarkable performance than state-of-the-art methods across polyadenylation signal benchmarks, and an ablation study shows the effectiveness of the model, and k -mer embedding is a good feature representation for polyadenylation signals.
科研通智能强力驱动
Strongly Powered by AbleSci AI