A deep learning-based method for the prediction of DNA interacting residues in a protein

人工智能 计算机科学 机器学习 深度学习 Boosting(机器学习) 梯度升压 RNA剪接 计算生物学 随机森林 基因 生物 遗传学 核糖核酸
作者
Sumeet Patiyal,Anjali Dhall,Gajendra P. S. Raghava
出处
期刊:Briefings in Bioinformatics [Oxford University Press]
卷期号:23 (5) 被引量:2
标识
DOI:10.1093/bib/bbac322
摘要

DNA-protein interaction is one of the most crucial interactions in the biological system, which decides the fate of many processes such as transcription, regulation and splicing of genes. In this study, we trained our models on a training dataset of 646 DNA-binding proteins having 15 636 DNA interacting and 298 503 non-interacting residues. Our trained models were evaluated on an independent dataset of 46 DNA-binding proteins having 965 DNA interacting and 9911 non-interacting residues. All proteins in the independent dataset have less than 30% of sequence similarity with proteins in the training dataset. A wide range of traditional machine learning and deep learning (1D-CNN) techniques-based models have been developed using binary, physicochemical properties and Position-Specific Scoring Matrix (PSSM)/evolutionary profiles. In the case of machine learning technique, eXtreme Gradient Boosting-based model achieved a maximum area under the receiver operating characteristics (AUROC) curve of 0.77 on the independent dataset using PSSM profile. Deep learning-based model achieved the highest AUROC of 0.79 on the independent dataset using a combination of all three profiles. We evaluated the performance of existing methods on the independent dataset and observed that our proposed method outperformed all the existing methods. In order to facilitate scientific community, we developed standalone software and web server, which are accessible from https://webs.iiitd.edu.in/raghava/dbpred.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
mark发布了新的文献求助10
1秒前
辉123发布了新的文献求助10
1秒前
酷波er应助现代代桃采纳,获得10
1秒前
SciGPT应助超帅pzc采纳,获得10
3秒前
damian完成签到,获得积分10
4秒前
桐桐应助dxtmm采纳,获得10
4秒前
5秒前
影啊影完成签到,获得积分10
6秒前
7秒前
天天快乐应助11采纳,获得10
8秒前
9秒前
钵钵鸡应助科研通管家采纳,获得10
11秒前
情怀应助科研通管家采纳,获得10
11秒前
bkagyin应助科研通管家采纳,获得10
11秒前
11秒前
cctv18应助科研通管家采纳,获得10
11秒前
cctv18应助科研通管家采纳,获得10
12秒前
Jasper应助科研通管家采纳,获得10
12秒前
FashionBoy应助hhhh采纳,获得10
12秒前
cctv18应助科研通管家采纳,获得10
12秒前
cctv18应助科研通管家采纳,获得10
12秒前
田様应助科研通管家采纳,获得10
12秒前
Orange应助科研通管家采纳,获得10
12秒前
今后应助科研通管家采纳,获得30
12秒前
shinysparrow应助科研通管家采纳,获得30
12秒前
13秒前
顾矜应助自信的冬日采纳,获得10
14秒前
15秒前
辉123完成签到,获得积分10
18秒前
dxtmm发布了新的文献求助10
19秒前
19秒前
Liucky完成签到,获得积分0
21秒前
明月欣完成签到,获得积分10
21秒前
ding应助妃莫笑采纳,获得10
23秒前
23秒前
入江直熠完成签到,获得积分10
23秒前
23秒前
fx完成签到,获得积分10
24秒前
Jinnianlun完成签到,获得积分10
24秒前
JL完成签到,获得积分10
24秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Sport in der Antike 800
De arte gymnastica. The art of gymnastics 600
少脉山油柑叶的化学成分研究 530
Mechanical Methods of the Activation of Chemical Processes 510
Electronic Structure Calculations and Structure-Property Relationships on Aromatic Nitro Compounds 500
Berns Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2417601
求助须知:如何正确求助?哪些是违规求助? 2109822
关于积分的说明 5336147
捐赠科研通 1836918
什么是DOI,文献DOI怎么找? 914794
版权声明 561072
科研通“疑难数据库(出版商)”最低求助积分说明 489235