Design of highly functional genome editors by modeling the universe of CRISPR-Cas sequences

清脆的 基因组 计算生物学 宇宙 生物 计算机科学 进化生物学 遗传学 物理 基因 天文
作者
Jeffrey A. Ruffolo,Stephen Nayfach,Joseph P. Gallagher,Aadyot Bhatnagar,Joel Beazer,Rafat Hussain,Jordan Russ,Jennifer Yip,Emily Hill,Martin Pačesa,Alexander J. Meeske,Peter Cameron,Ali Madani
标识
DOI:10.1101/2024.04.22.590591
摘要

Gene editing has the potential to solve fundamental challenges in agriculture, biotechnology, and human health. CRISPR-based gene editors derived from microbes, while powerful, often show significant functional tradeoffs when ported into non-native environments, such as human cells. Artificial intelligence (AI) enabled design provides a powerful alternative with potential to bypass evolutionary constraints and generate editors with optimal properties. Here, using large language models (LLMs) trained on biological diversity at scale, we demonstrate the first successful precision editing of the human genome with a programmable gene editor designed with AI. To achieve this goal, we curated a dataset of over one million CRISPR operons through systematic mining of 26 terabases of assembled genomes and meta-genomes. We demonstrate the capacity of our models by generating 4.8x the number of protein clusters across CRISPR-Cas families found in nature and tailoring single-guide RNA sequences for Cas9-like effector proteins. Several of the generated gene editors show comparable or improved activity and specificity relative to SpCas9, the prototypical gene editing effector, while being 400 mutations away in sequence. Finally, we demonstrate an AI-generated gene editor, denoted as OpenCRISPR-1, exhibits compatibility with base editing. We release OpenCRISPR-1 publicly to facilitate broad, ethical usage across research and commercial applications.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
2秒前
yumiao发布了新的文献求助10
3秒前
4秒前
5秒前
5秒前
5秒前
6秒前
9秒前
robust66发布了新的文献求助10
10秒前
10秒前
11秒前
Jasper应助清醒采纳,获得10
13秒前
damian发布了新的文献求助10
15秒前
ddd发布了新的文献求助10
16秒前
17秒前
Balance Man完成签到 ,获得积分10
17秒前
robust66完成签到,获得积分10
18秒前
19秒前
theyuyu完成签到,获得积分10
21秒前
时尚觅松发布了新的文献求助10
22秒前
24秒前
lxy完成签到,获得积分10
27秒前
江流有声发布了新的文献求助10
27秒前
无花果应助时尚觅松采纳,获得10
28秒前
28秒前
隔壁村花发布了新的文献求助10
32秒前
充电宝应助英俊小鼠采纳,获得10
37秒前
damian完成签到,获得积分10
38秒前
39秒前
Lucas应助漫漫采纳,获得10
43秒前
tong应助纨绔采纳,获得10
44秒前
活泼蓝发布了新的文献求助10
44秒前
CodeCraft应助眼药水采纳,获得10
45秒前
centlay应助诚心的三毒采纳,获得10
46秒前
gjww应助诚心的三毒采纳,获得10
46秒前
47秒前
强健的冰旋完成签到,获得积分10
48秒前
49秒前
着急的女侠完成签到,获得积分10
54秒前
54秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 800
Recherches Ethnographiques sue les Yao dans la Chine du Sud 500
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 500
Chinese-English Translation Lexicon Version 3.0 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 460
Wisdom, Gods and Literature Studies in Assyriology in Honour of W. G. Lambert 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2394074
求助须知:如何正确求助?哪些是违规求助? 2097914
关于积分的说明 5286344
捐赠科研通 1825393
什么是DOI,文献DOI怎么找? 910154
版权声明 559943
科研通“疑难数据库(出版商)”最低求助积分说明 486433