StackDPPIV: A novel computational approach for accurate prediction of dipeptidyl peptidase IV (DPP-IV) inhibitory peptides

特征(语言学) 人工智能 机器学习 计算机科学 判别式 概率逻辑 二肽基肽酶 模式识别(心理学) 数据挖掘 化学 语言学 生物化学 哲学
作者
Phasit Charoenkwan,Chanin Nantasenamat,Md Mehedi Hasan,Mohammad Ali Moni,Píetro Lió,Balachandran Manavalan,Watshara Shoombuatong
出处
期刊:Methods [Elsevier BV]
卷期号:204: 189-198 被引量:58
标识
DOI:10.1016/j.ymeth.2021.12.001
摘要

The development of efficient and effective bioinformatics tools and pipelines for identifying peptides with dipeptidyl peptidase IV (DPP-IV) inhibitory activities from large-scale protein datasets is of great importance for the discovery and development of potential and promising antidiabetic drugs. In this study, we present a novel stacking-based ensemble learning predictor (termed StackDPPIV) designed for identification of DPP-IV inhibitory peptides. Unlike the existing method, which is based on single-feature-based methods, we combined five popular machine learning algorithms in conjunction with ten different feature encodings from multiple perspectives to generate a pool of various baseline models. Subsequently, the probabilistic features derived from these baseline models were systematically integrated and deemed as new feature representations. Finally, in order to improve the predictive performance, the genetic algorithm based on the self-assessment-report was utilized to determine a set of informative probabilistic features and then used the optimal one for developing the final meta-predictor (StackDPPIV). Experiment results demonstrated that StackDPPIV could outperform its constituent baseline models on both the training and independent datasets. Furthermore, StackDPPIV achieved an accuracy of 0.891, MCC of 0.784 and AUC of 0.961, which were 9.4%, 19.0% and 11.4%, respectively, higher than that of the existing method on the independent test. Feature analysis demonstrated that our feature representations had more discriminative ability as compared to conventional feature descriptors, which highlights the combination of different features was essential for the performance improvement. In order to implement the proposed predictor, we had built a user-friendly online web server at http://pmlabstack.pythonanywhere.com/StackDPPIV.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
传奇3应助哒哒哒宰采纳,获得10
刚刚
着急的彩虹完成签到,获得积分10
刚刚
Coco发布了新的文献求助30
1秒前
molihuakai应助XFF采纳,获得10
1秒前
Nikii发布了新的文献求助10
1秒前
yyinh完成签到 ,获得积分10
1秒前
1秒前
金振龙完成签到,获得积分10
1秒前
tsf完成签到,获得积分10
1秒前
茉行完成签到,获得积分10
2秒前
三不知完成签到,获得积分10
2秒前
qiandaizi完成签到,获得积分10
2秒前
77完成签到,获得积分10
2秒前
个性的夜白完成签到,获得积分10
2秒前
李哈哈发布了新的文献求助10
3秒前
生动芝麻发布了新的文献求助10
3秒前
大大完成签到,获得积分10
3秒前
我是老大应助哒哒哒宰采纳,获得10
4秒前
爱炸鸡也爱烧烤完成签到 ,获得积分10
4秒前
自信的以旋完成签到 ,获得积分10
4秒前
上官若男应助好运6连采纳,获得10
4秒前
Cyndilovetodrink完成签到,获得积分10
4秒前
年年完成签到,获得积分10
4秒前
牧小妮完成签到,获得积分10
4秒前
luozhen完成签到,获得积分10
4秒前
4秒前
wings发布了新的文献求助30
4秒前
阿皮发布了新的文献求助10
5秒前
5秒前
Alex完成签到,获得积分10
5秒前
czd完成签到,获得积分10
6秒前
我是老大应助万木春采纳,获得10
6秒前
无极微光应助包容渊思采纳,获得20
6秒前
Solar_Parsifal完成签到,获得积分10
6秒前
秀秀应助啵啵龙采纳,获得10
8秒前
9秒前
zouzh完成签到 ,获得积分10
9秒前
10秒前
wait完成签到 ,获得积分10
10秒前
10秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Salmon nasal cartilage-derived proteoglycan complexes influence the gut microbiota and bacterial metabolites in mice 2000
The Composition and Relative Chronology of Dynasties 16 and 17 in Egypt 1500
Cowries - A Guide to the Gastropod Family Cypraeidae 1200
ON THE THEORY OF BIRATIONAL BLOWING-UP 666
Signals, Systems, and Signal Processing 610
“美军军官队伍建设研究”系列(全册) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6384967
求助须知:如何正确求助?哪些是违规求助? 8198184
关于积分的说明 17339295
捐赠科研通 5438554
什么是DOI,文献DOI怎么找? 2876129
邀请新用户注册赠送积分活动 1852690
关于科研通互助平台的介绍 1697046