数据库
生物催化
计算机科学
酶动力学
功能(生物学)
动力学
计算生物学
酶
数据挖掘
化学
生物
活动站点
物理
遗传学
反应机理
生物化学
量子力学
催化作用
作者
Bailu Yan,Xinchun Ran,Anvita Gollu,Zihao Cheng,Xiang Zhou,Yiwen Chen,Zhongyue Yang
标识
DOI:10.1021/acs.jcim.2c01139
摘要
Data-driven modeling has emerged as a new paradigm for biocatalyst design and discovery. Biocatalytic databases that integrate enzyme structure and function data are in urgent need. Here we describe IntEnzyDB as an integrated structure–kinetics database for facile statistical modeling and machine learning. IntEnzyDB employs a relational database architecture with a flattened data structure, which allows rapid data operation. This architecture also makes it easy for IntEnzyDB to incorporate more types of enzyme function data. IntEnzyDB contains enzyme kinetics and structure data from six enzyme commission classes. Using 1050 enzyme structure–kinetics pairs, we investigated the efficiency-perturbing propensities of mutations that are close or distal to the active site. The statistical results show that efficiency-enhancing mutations are globally encoded and that deleterious mutations are much more likely to occur in close mutations than in distal mutations. Finally, we describe a web interface that allows public users to access enzymology data stored in IntEnzyDB. IntEnzyDB will provide a computational facility for data-driven modeling in biocatalysis and molecular evolution.
科研通智能强力驱动
Strongly Powered by AbleSci AI