亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

FusionESP: Improved Enzyme–Substrate Pair Prediction by Fusing Protein and Chemical Knowledge

底物特异性 基质(水族馆) 化学 计算机科学 计算生物学 人工智能 生物化学 生物 生态学
作者
Zhenjiao Du,Weimin Fu,Xiaolong Guo,Doina Caragea,Yonghui Li
出处
期刊:Journal of Chemical Information and Modeling [American Chemical Society]
卷期号:65 (6): 2806-2817 被引量:10
标识
DOI:10.1021/acs.jcim.4c02357
摘要

To reduce the cost of the experimental characterization of the potential substrates for enzymes, machine learning prediction models offer an alternative solution. Pretrained language models, as powerful approaches for protein and molecule representation, have been employed in the development of enzyme-substrate prediction models, achieving promising performance. In addition to continuing improvements in language models, effectively fusing encoders to handle multimodal prediction tasks is critical for further enhancing model performance by using available representation methods. Here, we present FusionESP, a multimodal architecture that integrates protein and chemistry language models with two independent projection heads and a contrastive learning strategy for predicting enzyme-substrate pairs. Our best model achieved state-of-the-art performance with an accuracy of 94.77% on independent test data and exhibited better generalization capacity while requiring fewer computational resources and training data, compared to previous studies of a fine-tuned encoder or employing more encoders. It also confirmed our hypothesis that embeddings of positive pairs are closer to each other in a high-dimension space, while negative pairs exhibit the opposite trend. Our ablation studies showed that the projection heads played a crucial role in performance enhancement, while the contrastive learning strategy further improved the projection heads' capacity in classification tasks. The proposed architecture is expected to be further applied to enhance performance in additional multimodality prediction tasks in biology. A user-friendly web server of FusionESP is established and freely accessible at https://rqkjkgpsyu.us-east-1.awsapprunner.com/.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
天天天晴完成签到 ,获得积分10
1秒前
zjy关闭了zjy文献求助
6秒前
吴羊羽发布了新的文献求助10
8秒前
ChencanFang完成签到,获得积分10
13秒前
吴羊羽完成签到 ,获得积分10
33秒前
可意发布了新的文献求助10
34秒前
666完成签到,获得积分20
45秒前
ccc完成签到 ,获得积分10
50秒前
51秒前
忧郁山蝶发布了新的文献求助10
56秒前
无极微光应助可意采纳,获得20
57秒前
星辰大海应助科研通管家采纳,获得10
57秒前
华仔应助科研通管家采纳,获得10
57秒前
58秒前
58秒前
1分钟前
无限千万完成签到 ,获得积分10
1分钟前
可意完成签到,获得积分10
1分钟前
1分钟前
忧郁山蝶发布了新的文献求助10
1分钟前
如花发布了新的文献求助30
1分钟前
温软完成签到 ,获得积分10
1分钟前
搞怪汝燕发布了新的文献求助10
1分钟前
1分钟前
2分钟前
2分钟前
2分钟前
焦糖布丁发布了新的文献求助10
2分钟前
ataybabdallah完成签到,获得积分10
2分钟前
蒽女士完成签到,获得积分10
2分钟前
无花果应助蒽女士采纳,获得10
2分钟前
斜阳完成签到 ,获得积分10
2分钟前
2分钟前
如花完成签到,获得积分10
2分钟前
2分钟前
妮可发布了新的文献求助10
2分钟前
2分钟前
3分钟前
翩璸发布了新的文献求助10
3分钟前
3分钟前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Competition Law: Cases and Materials, 5th edition 500
Introduction to Cosmetic Formulation and Technology, 2nd Edition 400
Petrology and Plate Tectonics,2025 400
Burger's Medicinal Chemistry and Drug Discovery 400
A Step-by-Step Guide to Qualitative Data Coding 2nd Edition 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6706679
求助须知:如何正确求助?哪些是违规求助? 8447382
关于积分的说明 18040380
捐赠科研通 5947429
什么是DOI,文献DOI怎么找? 2991299
邀请新用户注册赠送积分活动 1967237
关于科研通互助平台的介绍 1913457