卷积神经网络
计算机科学
人工智能
蛋白质功能预测
UniProt公司
深度学习
基因本体论
人工神经网络
机器学习
功能(生物学)
模式识别(心理学)
蛋白质功能
基因
生物
遗传学
基因表达
作者
Mohamed E.M. Elhaj-Abdou,Hassan Eldib,Amr El-Helw,Mohamed El-Habrouk
标识
DOI:10.1016/j.compbiolchem.2021.107584
摘要
Protein amino acid sequences can be used to determine the functions of the protein. However, determining the function of a single protein requires many resources and a tremendous amount of time. Computational Intelligence methods such as Deep learning have been shown to predict the proteins' functions. This paper proposes a hybrid deep neural network model to predict an unknown protein's functions from sequences. The proposed model is named Deep_CNN_LSTM_GO. Deep_CNN_LSTM_GO is an Integration between Convolutional Neural network (CNN) and Long Short-Term Memory (LSTM) Neural Network to learn features from amino acid sequences and outputs the three different Gene Ontology (GO). The gene ontology represents the protein functions in the three sub-ontologies: Molecular Functions (MF), Biological Process (BP), and Cellular Component (CC). The proposed model has been trained and tested using UniProt-SwissProt's dataset. Another test has been done using Computational Assessment of Function Annotation (CAFA) on the three sub-ontologies. The proposed model outperforms different methods proposed in the field with better performance using three different evaluation metrics (Fmax, Smin, and AUPR) in the three sub-ontologies (MF, BP, CC).
科研通智能强力驱动
Strongly Powered by AbleSci AI