特拉-
质谱法
计算机科学
比例(比率)
化学
物理
色谱法
操作系统
量子力学
作者
Konstantin S. Kozlov,Daniil A. Boiko,Julia V. Burykina,Valentina V. Ilyushenkova,Alexander Yu. Kostyukovich,E. D. Patil,Valentine P. Ananikov
标识
DOI:10.1038/s41467-025-56905-8
摘要
The accumulation of large datasets by the scientific community has surpassed the capacity of traditional processing methods, underscoring the critical need for innovative and efficient algorithms capable of navigating through extensive existing experimental data. Addressing this challenge, our study introduces a machine learning (ML)-powered search engine specifically tailored for analyzing tera-scale high-resolution mass spectrometry (HRMS) data. This engine harnesses a novel isotope-distribution-centric search algorithm augmented by two synergistic ML models, assisting with the discovery of hitherto unknown chemical reactions. This methodology enables the rigorous investigation of existing data, thus providing efficient support for chemical hypotheses while reducing the need for conducting additional experiments. Moreover, we extend this approach with baseline methods for automated reaction hypothesis generation. In its practical validation, our approach successfully identified several reactions, unveiling previously undescribed transformations. Among these, the heterocycle-vinyl coupling process within the Mizoroki-Heck reaction stands out, highlighting the capability of the engine to elucidate complex chemical phenomena. Mass spectrometry generates vast amounts of data in chemistry labs. Here, authors developed a machine learning-driven search engine that analyzes archived data to discover chemical reactions without performing additional experiments.
科研通智能强力驱动
Strongly Powered by AbleSci AI