Prediction of Multiple Types of RNA Modifications via Biological Language Model

计算机科学 计算生物学 人工智能 核糖核酸 生物 遗传学 基因
作者
Ying Zhang,Fang Ge,Fuyi Li,Xibei Yang,Jiangning Song,Dong‐Jun Yu
出处
期刊:IEEE/ACM Transactions on Computational Biology and Bioinformatics [Institute of Electrical and Electronics Engineers]
卷期号:20 (5): 3205-3214 被引量:12
标识
DOI:10.1109/tcbb.2023.3283985
摘要

It has been demonstrated that RNA modifications play essential roles in multiple biological processes. Accurate identification of RNA modifications in the transcriptome is critical for providing insights into the biological functions and mechanisms. Many tools have been developed for predicting RNA modifications at single-base resolution, which employ conventional feature engineering methods that focus on feature design and feature selection processes that require extensive biological expertise and may introduce redundant information. With the rapid development of artificial intelligence technologies, end-to-end methods are favorably received by researchers. Nevertheless, each well-trained model is only suitable for a specific RNA methylation modification type for nearly all of these approaches. In this study, we present MRM-BERT by feeding task-specific sequences into the powerful BERT (Bidirectional Encoder Representations from Transformers) model and implementing fine-tuning, which exhibits competitive performance to the state-of-the-art methods. MRM-BERT avoids repeated de novo training of the model and can predict multiple RNA modifications such as pseudouridine, m6A, m5C, and m1A in Mus musculus , Arabidopsis thaliana , and Saccharomyces cerevisiae . In addition, we analyse the attention heads to provide high attention regions for the prediction, and conduct saturated in silico mutagenesis of the input sequences to discover potential changes of RNA modifications, which can better assist researchers in their follow-up research.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
积极的随阴完成签到,获得积分10
刚刚
Ran发布了新的文献求助10
2秒前
英勇的数据线完成签到,获得积分20
2秒前
温柔以冬完成签到 ,获得积分10
2秒前
3秒前
3秒前
4秒前
甜甜的大香瓜完成签到 ,获得积分10
4秒前
宿帅帅完成签到,获得积分10
4秒前
4秒前
脑洞疼应助研友_7ZebY8采纳,获得10
5秒前
5秒前
7秒前
李爱国应助俭朴的发带采纳,获得30
7秒前
春二虫发布了新的文献求助10
7秒前
小夫发布了新的文献求助10
8秒前
无辜的银耳汤完成签到,获得积分10
8秒前
害羞安萱完成签到,获得积分20
8秒前
赘婿应助哎哟哎哟采纳,获得10
8秒前
hy发布了新的文献求助10
9秒前
9秒前
开朗若之完成签到 ,获得积分10
9秒前
简单之玉发布了新的文献求助10
9秒前
务实剑鬼完成签到,获得积分10
9秒前
10秒前
大气靳发布了新的文献求助10
11秒前
Hh完成签到,获得积分10
11秒前
12秒前
笑点低莛发布了新的文献求助10
12秒前
科研通AI6.1应助阳光c采纳,获得30
12秒前
wang完成签到,获得积分10
13秒前
浮游呦呦完成签到,获得积分10
13秒前
淡淡新竹发布了新的文献求助10
14秒前
14秒前
王颖超发布了新的文献求助10
14秒前
害羞安萱发布了新的文献求助10
14秒前
hy完成签到,获得积分20
14秒前
15秒前
马吉克发布了新的文献求助10
15秒前
女乔发布了新的文献求助10
16秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
基于非线性光纤环形镜的全保偏锁模激光器研究-上海科技大学 800
Pulse width control of a 3-phase inverter with non sinusoidal phase voltages 777
Signals, Systems, and Signal Processing 610
Research Methods for Business: A Skill Building Approach, 9th Edition 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6409356
求助须知:如何正确求助?哪些是违规求助? 8228540
关于积分的说明 17457312
捐赠科研通 5462304
什么是DOI,文献DOI怎么找? 2886340
邀请新用户注册赠送积分活动 1862745
关于科研通互助平台的介绍 1702227