Uncertainty Quantification and Temperature Scaling Calibration for Protein-RNA Binding Site Prediction

校准 缩放比例 化学 核糖核酸 生物系统 计算生物学 生物物理学 生物 生物化学 统计 数学 几何学 基因
作者
Ximin Zeng,Hongmei Wang,Long Zhao,Yue Cheng,Dezhong Zhou,Shaoping Shi
出处
期刊:Journal of Chemical Information and Modeling [American Chemical Society]
标识
DOI:10.1021/acs.jcim.5c00556
摘要

The black-box nature of deep learning has increasingly drawn attention to the reliability and uncertainty of predictive models. Currently, several uncertainty quantification (UQ) methods have been proposed and successfully applied in the fields of molecules and proteins, effectively improving model prediction quality and interpretability. Protein-RNA binding represents a fundamental aspect of protein research. Accurate prediction of binding sites and ensuring the reliability of such predictions are crucial for various scientific endeavors. However, many of the existing computational methods have a single feature extraction and lack of UQ. To address these, we propose MGCA (multiscale graph convolutional networks, convolutional neural networks and attention) to better capture local and global information and achieve competitive results in predicting protein-RNA binding sites. Moreover, we launch a UQ study based on MGCA and five prevalent models to verify the robustness of the results. Specifically, we introduce the Expected Calibration Error (ECE) to assess the uncertainty of the models. Additionally, a novel split-bins screening method is proposed based on the ECE, aiming to investigate the practical impact of reducing uncertainty on the models. Finally, temperature scaling (TS) is used to calibrate model uncertainty without changing performance. Results show that the split-bins screening method reduces false positives (FP), and TS significantly decreases the model ECE. The split-bins screening method combined with TS can further reduce FP and improve precision. Our findings demonstrate that TS effectively reduces uncertainty in protein-RNA binding site prediction, and minimizing model uncertainty enhances prediction quality. The data and code can be available at https://github.com/trustcm/UQ-TS-Split-bins-RBP.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
CipherSage应助灰蓝采纳,获得10
刚刚
1秒前
diladila完成签到,获得积分10
1秒前
乐乐应助满意日记本采纳,获得10
1秒前
芋泥奶酪完成签到,获得积分10
2秒前
熊孩子完成签到,获得积分10
2秒前
2秒前
小蜜蜂完成签到,获得积分10
2秒前
4秒前
4秒前
共享精神应助layz采纳,获得10
4秒前
pupu发布了新的文献求助10
4秒前
txmjsn完成签到,获得积分10
5秒前
5秒前
Diiirrk完成签到,获得积分20
6秒前
6秒前
6秒前
Ava应助啦啦啦采纳,获得10
7秒前
8秒前
无忧皮卡发布了新的文献求助10
9秒前
JamesPei应助mumu采纳,获得10
9秒前
9秒前
jianli发布了新的文献求助10
10秒前
共享精神应助纳兰嫣然采纳,获得10
10秒前
无情静柏发布了新的文献求助10
11秒前
12秒前
葱姜蒜辣椒香菜我全要完成签到,获得积分10
13秒前
九玖酒发布了新的文献求助50
14秒前
大拿发布了新的文献求助10
14秒前
14秒前
耍酷小tutu完成签到,获得积分10
15秒前
火星上的大白菜完成签到,获得积分10
16秒前
16秒前
忧郁难敌发布了新的文献求助10
16秒前
why完成签到,获得积分10
17秒前
18秒前
18秒前
YY完成签到,获得积分10
18秒前
尘香如故完成签到 ,获得积分10
19秒前
高分求助中
Clinical Epidemiology: The Essentials, 6e 10000
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Graphene Handbook (2019 Edition) 800
Adhesion Science: Principles & Practice 800
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
久松真一著作集〈第5巻〉禅と芸術 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6544499
求助须知:如何正确求助?哪些是违规求助? 8333902
关于积分的说明 17858762
捐赠科研通 5653067
什么是DOI,文献DOI怎么找? 2937270
邀请新用户注册赠送积分活动 1913584
关于科研通互助平台的介绍 1776345