热稳定性
人工智能
机器学习
生物勘探
计算机科学
分类器(UML)
深度学习
稳健性(进化)
人工神经网络
蛋白质测序
中层
氨基酸残基
肽序列
生物
基因
生物化学
遗传学
细菌
植物
酶
作者
Xiaoyang Xiang,Jiaxuan Gao,Yanrui Ding
标识
DOI:10.1089/cmb.2023.0097
摘要
Using wet experimental methods to discover new thermophilic proteins or improve protein thermostability is time-consuming and expensive. Machine learning methods have shown powerful performance in the study of protein thermostability in recent years. However, how to make full use of multiview sequence information to predict thermostability effectively is still a challenge. In this study, we proposed a deep learning-based classifier named DeepPPThermo that fuses features of classical sequence features and deep learning representation features for classifying thermophilic and mesophilic proteins. In this model, deep neural network (DNN) and bi-long short-term memory (Bi-LSTM) are used to mine hidden features. Furthermore, local attention and global attention mechanisms give different importance to multiview features. The fused features are fed to a fully connected network classifier to distinguish thermophilic and mesophilic proteins. Our model is comprehensively compared with advanced machine learning algorithms and deep learning algorithms, proving that our model performs better. We further compare the effects of removing different features on the classification results, demonstrating the importance of each feature and the robustness of the model. Our DeepPPThermo model can be further used to explore protein diversity, identify new thermophilic proteins, and guide directed mutations of mesophilic proteins.
科研通智能强力驱动
Strongly Powered by AbleSci AI