计算机科学
卷积神经网络
范围(计算机科学)
一般化
财产(哲学)
不变(物理)
人工智能
等变映射
钥匙(锁)
旋转(数学)
子空间拓扑
理论(学习稳定性)
突变
机器学习
算法
数学
计算机安全
纯数学
程序设计语言
化学
数学物理
生物化学
哲学
数学分析
认识论
基因
作者
He Chen,Yifan Cheng,Jianqiang Dong,Jie Mao,Xin Wang,Yuan Gao,Yuchao Li,Chengzhi Wang,Qiong Wu
标识
DOI:10.1101/2024.02.07.579261
摘要
Abstract Predicting the properties of proteins is an important procedure in protein engineering. It determines the subspace of mutations for protein modifications, which is critical to the success of the project, but heavily relies on the knowledge and experience of scientists. In this study, we propose a novel deep 3D-CNN model, Eq3DCNN, specifically designed for local environment-related tasks in protein engineering. Eq3DCNN uses basic atom descriptors and their coordinates as inputs, utilizing customized data augmentations to enhance its training efficiency. To make the Eq3DCNN extracted features with more generalization capability, we incorporated a rotation equivariant module to get rotation invariant features. Using cross-validations with different data splitting strategies and under the scenarios of zero-shot predictions, we demonstrate that Eq3DCNN outperformed other 3D-CNN models in stability predictions, and also well-preformed on other prediction tasks, such as the binding pocket and the secondary structure predictions. Our results also identified the key factors that contribute to the model’s accuracy and the scope of its applications. These findings may help scientists in designing better mutation experiments and increasing the success rate in protein engineering.
科研通智能强力驱动
Strongly Powered by AbleSci AI