基因
遗传学
生物
起始密码子
密码子使用偏好性
计算生物学
分子生物学
基序列
基因组
作者
Daniel Wong,Kam‐Ho Wong,Sunjae Park,Grégory Boël,J.F. Hunt,Daniel P. Aalberts
标识
DOI:10.1016/j.jmb.2025.168965
摘要
The ability to overexpress proteins is valuable for biotechnology, but not all sequences are compatible with high yield. We previously analyzed the sequence features and mRNA folding stability of a large data set of 6,384 distinct gene constructs, and developed a model for protein yield. Our OPT.williams.edu server (1) predicts the probability an input sequence will produce protein at a high level when overexpressed in E. coli, and (2) returns optimized synonymous sequences designed to boost protein expression. Here we also present experimental evidence of the high yields of our OPT constructs for eight commercially produced proteins.
科研通智能强力驱动
Strongly Powered by AbleSci AI