计算机科学
语义学(计算机科学)
人工智能
功能(生物学)
代表(政治)
嵌入
机器学习
数据挖掘
特征(语言学)
语义相似性
班级(哲学)
人工神经网络
文字嵌入
自然语言处理
语义网络
模式识别(心理学)
比例(比率)
相似性(几何)
作者
Hanwen Zhou,Te Zhang,Zhaohong Deng,Guanjin Wang,Zhisheng Wei,Lei Wang,Xiaoyong Pan,Hong‐Bin Shen,Dong‐Jun Yu,Jing Wu
标识
DOI:10.1109/tcbbio.2025.3644035
摘要
Comprehending biological reproduction and cellular metabolism is facilitated by the Enzyme Commission, which matches protein sequences to the biochemical reactions they catalyse through EC numbers. In recent years, several methods have been proposed for predicting enzyme function. However, these methods still encounter challenges. Firstly, traditional methods for manually designing enzyme features are complex and cumbersome, lacking an effective generalized method for embedding enzyme sequences. Secondly, the distribution gap between different enzymes is significant, which resulting in existing methods struggling to predict multilevel enzyme functions. Thirdly, traditional enzyme function prediction models only extract single view feature of enzyme, so there is still room for further improving the ability of these models to extract enzyme data. To address these challenges, a new multilevel enzyme function prediction model (SMENET) based on multi-view semantics is proposed. This method uses protein large language model to extract semantic information. Subsequently, this semantic information is fed into multiple information extraction network modules, followed by using Biologic Sematic Attention to integrate these views' information. Finally, a multi-view adaptive fusion network is designed to extract the best common representation between multiple semantic views. Extensive experiments were conducted on multiple datasets to validate the effectiveness of SMENET.
科研通智能强力驱动
Strongly Powered by AbleSci AI