An Overview of Protein Function Prediction Methods: A Deep Learning Perspective

蛋白质功能预测 计算机科学 功能(生物学) 蛋白质功能 注释 深度学习 机器学习 人工智能 数据挖掘 生物 生物化学 进化生物学 基因
作者
Stefano Toppo,Federico Bianca,Emilio Ispano,Enrico Lavezzo
出处
期刊:Current Bioinformatics [Bentham Science]
卷期号:18 (8): 621-630
标识
DOI:10.2174/1574893618666230505103556
摘要

Abstract: Predicting the function of proteins is a major challenge in the scientific community, particularly in the post-genomic era. Traditional methods of determining protein functions, such as experiments, are accurate but can be resource-intensive and time-consuming. The development of Next Generation Sequencing (NGS) techniques has led to the production of a large number of new protein sequences, which has increased the gap between available raw sequences and verified annotated sequences. To address this gap, automated protein function prediction (AFP) techniques have been developed as a faster and more cost-effective alternative, aiming to maintain the same accuracy level. : Several automatic computational methods for protein function prediction have recently been developed and proposed. This paper reviews the best-performing AFP methods presented in the last decade and analyzes their improvements over time to identify the most promising strategies for future methods. : Identifying the most effective method for predicting protein function is still a challenge. The Critical Assessment of Functional Annotation (CAFA) has established an international standard for evaluating and comparing the performance of various protein function prediction methods. In this study, we analyze the best-performing methods identified in recent editions of CAFA. These methods are divided into five categories based on their principles of operation: sequence-based, structure-based, combined-based, ML-based and embeddings-based. : After conducting a comprehensive analysis of the various protein function prediction methods, we observe that there has been a steady improvement in the accuracy of predictions over time, mainly due to the implementation of machine learning techniques. The present trend suggests that all the bestperforming methods will use machine learning to improve their accuracy in the future. : We highlight the positive impact that the use of machine learning (ML) has had on protein function prediction. Most recent methods developed in this area use ML, demonstrating its importance in analyzing biological information and making predictions. Despite these improvements in accuracy, there is still a significant gap compared with experimental evidence. The use of new approaches based on Deep Learning (DL) techniques will probably be necessary to close this gap, and while significant progress has been made in this area, there is still more work to be done to fully realize the potential of DL.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
208应助小巧的秋白采纳,获得10
1秒前
Shadow完成签到 ,获得积分10
2秒前
uu完成签到,获得积分10
5秒前
8秒前
12秒前
超棒的发布了新的文献求助10
14秒前
16秒前
gsdf发布了新的文献求助10
17秒前
19秒前
幸福果汁发布了新的文献求助10
21秒前
飞天星宇发布了新的文献求助10
22秒前
kk发布了新的文献求助10
22秒前
华仔应助惊鸿一面采纳,获得10
23秒前
璐璐完成签到 ,获得积分10
25秒前
YINZHE应助memem1采纳,获得10
26秒前
顾矜应助memem1采纳,获得10
26秒前
28秒前
风犬少年完成签到,获得积分10
29秒前
张华完成签到 ,获得积分10
31秒前
852应助xiaowu采纳,获得10
32秒前
33秒前
飞天星宇完成签到,获得积分10
33秒前
专注忆寒发布了新的文献求助10
41秒前
44秒前
water完成签到,获得积分10
44秒前
46秒前
zzz完成签到,获得积分10
48秒前
Autin发布了新的文献求助10
50秒前
aa发布了新的文献求助10
53秒前
格局太小完成签到 ,获得积分20
53秒前
水木飞雪完成签到,获得积分10
56秒前
共享精神应助blUe采纳,获得10
58秒前
华仔应助Amon采纳,获得50
58秒前
58秒前
yxy999发布了新的文献求助10
59秒前
王大锤完成签到,获得积分10
1分钟前
陈居居完成签到,获得积分10
1分钟前
Dante发布了新的文献求助10
1分钟前
1分钟前
1分钟前
高分求助中
请在求助之前详细阅读求助说明!!!! 20000
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 700
[Lambert-Eaton syndrome without calcium channel autoantibodies] 520
Pressing the Fight: Print, Propaganda, and the Cold War 500
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2471096
求助须知:如何正确求助?哪些是违规求助? 2137771
关于积分的说明 5447301
捐赠科研通 1861745
什么是DOI,文献DOI怎么找? 925893
版权声明 562740
科研通“疑难数据库(出版商)”最低求助积分说明 495275