NIST公司
素描
计算机科学
沸点
图形
人工神经网络
人工智能
卷积神经网络
数据集
机器学习
集合(抽象数据类型)
数据挖掘
算法
理论计算机科学
热力学
自然语言处理
物理
程序设计语言
作者
Chen Qu,Anthony J. Kearsley,Barry I. Schneider,Walid Keyrouz,Thomas C. Allison
标识
DOI:10.1016/j.jmgm.2022.108149
摘要
In this article, we describe training and validation of a machine learning model for the prediction of organic compound normal boiling points. Data are drawn from the experimental literature as captured in the NIST Thermodynamics Research Center (TRC) SOURCE Data Archival System. The machine learning model is based on a graph neural network approach, a methodology that has proven powerful when applied to a variety of chemical problems. Model input is extracted from a 2D sketch of the molecule, making the methodology suitable for rapid prediction of normal boiling points in a wide variety of scenarios. Our final model predicts normal boiling points within 6 K (corresponding to a mean absolute percent error of 1.32%) with sample standard deviation less than 8 K. Additionally, we found that our model robustly identifies errors in the input data set during the model training phase, thereby further motivating the utility of systematic data exploration approaches for data-related efforts.
科研通智能强力驱动
Strongly Powered by AbleSci AI