化学
反应性(心理学)
催化作用
吞吐量
联轴节(管道)
高通量筛选
组合化学
有机化学
生物化学
医学
机械工程
电信
替代医学
病理
计算机科学
工程类
无线
作者
Seung Kyun Ha,Dipannita Kalyani,Michael S. West,Jessica Xu,Yu‐hong Lam,Thomas J. Struble,Spencer D. Dreher,Shane W. Krska,Stephen L. Buchwald,Klavs F. Jensen
摘要
This manuscript presents machine learning models for Pd-catalyzed C-N couplings constructed using a large, pharmaceutically relevant, structurally diverse dataset (4204 unique products) generated de novo using high-throughput experimentation. The dataset generation was enabled by the discovery of novel nanomole scale compatible automation friendly C-N coupling reaction conditions using LiOTMS as the base. The large dataset enabled the systematic evaluation of model performance using five different data-splitting strategies that were carefully designed to assess the models' ability to both interpolate and extrapolate. The models exhibit high predictive performance across all splits as gauged by standard metrics. In addition, the models predicted with high accuracy the outcome of validation libraries that were outside the scope of the training set. Employing these models in the context of medicinal chemistry campaigns should result in significant enrichment of successful C-N couplings.
科研通智能强力驱动
Strongly Powered by AbleSci AI