计算机科学
机器学习
人工智能
蛋白质工程
工作流程
深度学习
稳健性(进化)
杠杆(统计)
生物
数据库
生物化学
基因
酶
作者
Mason Minot,Sai T. Reddy
标识
DOI:10.1016/j.cels.2023.12.003
摘要
Machine learning-guided protein engineering is rapidly progressing; however, collecting high-quality, large datasets remains a bottleneck. Directed evolution and protein engineering studies often require extensive experimental processes to eliminate noise and label protein sequence-function data. Meta learning has proven effective in other fields in learning from noisy data via bi-level optimization given the availability of a small dataset with trusted labels. Here, we leverage meta learning approaches to overcome noisy and under-labeled data and expedite workflows in antibody engineering. We generate yeast display antibody mutagenesis libraries and screen them for target antigen binding followed by deep sequencing. We then create representative learning tasks, including learning from noisy training data, positive and unlabeled learning, and learning out of distribution properties. We demonstrate that meta learning has the potential to reduce experimental screening time and improve the robustness of machine learning models by training with noisy and under-labeled training data.
科研通智能强力驱动
Strongly Powered by AbleSci AI