Leveraging structure for enzyme function prediction: methods, opportunities, and challenges

计算生物学 功能(生物学) 生物 鉴定(生物学) 蛋白质功能 基因组 生物化学 遗传学 基因 植物
作者
Matthew P. Jacobson,Chakrapani Kalyanaraman,Suwen Zhao,Boxue Tian
出处
期刊:Trends in Biochemical Sciences [Elsevier BV]
卷期号:39 (8): 363-371 被引量:34
标识
DOI:10.1016/j.tibs.2014.05.006
摘要

•Of the >50 million protein sequences, <1% have experimentally determined functions. •Protein structures can provide clues to function, such as the substrates of enzymes. •Homology modeling and ligand docking algorithms can help infer function from structure. •Recent successes include discovery of novel metabolites, enzymes, and pathways. The rapid growth of the number of protein sequences that can be inferred from sequenced genomes presents challenges for function assignment, because only a small fraction (currently <1%) has been experimentally characterized. Bioinformatics tools are commonly used to predict functions of uncharacterized proteins. Recently, there has been significant progress in using protein structures as an additional source of information to infer aspects of enzyme function, which is the focus of this review. Successful application of these approaches has led to the identification of novel metabolites, enzyme activities, and biochemical pathways. We discuss opportunities to elucidate systematically protein domains of unknown function, orphan enzyme activities, dead-end metabolites, and pathways in secondary metabolism. The rapid growth of the number of protein sequences that can be inferred from sequenced genomes presents challenges for function assignment, because only a small fraction (currently <1%) has been experimentally characterized. Bioinformatics tools are commonly used to predict functions of uncharacterized proteins. Recently, there has been significant progress in using protein structures as an additional source of information to infer aspects of enzyme function, which is the focus of this review. Successful application of these approaches has led to the identification of novel metabolites, enzyme activities, and biochemical pathways. We discuss opportunities to elucidate systematically protein domains of unknown function, orphan enzyme activities, dead-end metabolites, and pathways in secondary metabolism. a computational technique that builds an atomic model of a target protein using its sequence and an experimental 3D structure of a homologous protein (called the 'template'). The quality of a homology model depends on the accuracy of the sequence alignment between target and template, which varies (loosely) with the sequence identity (roughly speaking, pairwise identity higher than 40% is ideal, and lower than 25% is poor). a computational technique that predicts and ranks the binding poses of small molecule ligands to receptors (e.g., proteins). Docking usually comprises a sampling method that generates possible binding poses of a ligand in a binding site, and a scoring function that ranks these poses. Most scoring functions are empirical, and give only a crude estimate of the binding free energy of a ligand. biochemical pathways to produce organic molecules (i.e., secondary metabolites) that are not absolutely required for the survival of the organism. There are five particularly prevalent classes of secondary metabolite: isoprenoids, alkaloids, polyketides, nonribosomal peptides, and ribosomally synthesized and post-translationally modified peptides. Secondary metabolites are often restricted to a narrow set of species and have important ecological roles for the organisms that produce them. Many secondary metabolites are bioactive (antibacterial, anticancer, antifungal, antiviral, antioxidant, anti-inflammatory, antiparasitic, antimalaria, cytotoxic, etc.) and have been used as drugs and drug leads. an effort to determine the 3D, atomic-level structure of every protein encoded by a genome through a combination of high-throughput experimental and modeling approaches. The determination of a protein structure though a structural genomics effort often precedes knowledge of its function, motivating the development of methods to infer function from structure.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
1秒前
3秒前
欣慰友梅完成签到,获得积分10
3秒前
青春发布了新的文献求助10
3秒前
4秒前
5秒前
善学以致用应助XXHH采纳,获得10
5秒前
5秒前
欣慰友梅发布了新的文献求助10
6秒前
7秒前
8秒前
上官若男应助沉默的凝云采纳,获得30
8秒前
科研通AI5应助una采纳,获得10
9秒前
9秒前
野火197完成签到,获得积分10
9秒前
淡然善斓完成签到,获得积分10
10秒前
10秒前
Leeon完成签到,获得积分10
10秒前
灵灵发布了新的文献求助20
11秒前
朱朱朱朱完成签到,获得积分10
11秒前
11秒前
11秒前
11秒前
12秒前
12秒前
pebble完成签到,获得积分10
13秒前
岁大爷发布了新的文献求助20
13秒前
猎空完成签到,获得积分10
13秒前
微笑笑南发布了新的文献求助10
13秒前
隐形曼青应助小姜采纳,获得10
14秒前
yuanweisun发布了新的文献求助30
15秒前
yuu发布了新的文献求助10
15秒前
星川发布了新的文献求助30
17秒前
17秒前
共享精神应助朱朱朱朱采纳,获得10
17秒前
潘潘潘完成签到 ,获得积分10
17秒前
blue应助Edwin采纳,获得20
18秒前
Jasper应助lkx采纳,获得10
19秒前
20秒前
高分求助中
Applied Survey Data Analysis (第三版, 2025) 800
Narcissistic Personality Disorder 700
Handbook of Experimental Social Psychology 500
The Martian climate revisited: atmosphere and environment of a desert planet 500
建国初期十七年翻译活动的实证研究. 建国初期十七年翻译活动的实证研究 400
Transnational East Asian Studies 400
Towards a spatial history of contemporary art in China 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3846643
求助须知:如何正确求助?哪些是违规求助? 3389216
关于积分的说明 10556235
捐赠科研通 3109602
什么是DOI,文献DOI怎么找? 1713825
邀请新用户注册赠送积分活动 824934
科研通“疑难数据库(出版商)”最低求助积分说明 775135