机器学习
计算机科学
人工智能
爆速
起爆
爆炸物
趋同(经济学)
核(代数)
集合(抽象数据类型)
算法
数学
化学
组合数学
经济增长
经济
有机化学
程序设计语言
作者
Daniel C. Elton,Zois Boukouvalas,Mark S. Butrico,Mark Fuge,Peter W. Chung
标识
DOI:10.26434/chemrxiv.5883157.v2
摘要
We present a proof of concept that machine learning techniques can be used to predict the properties of CNOHF energetic molecules from their molecular structures. We focus on a small but diverse dataset consisting of 109 molecular structures spread across ten compound classes. Up until now, candidate molecules for energetic materials have been screened using predictions from expensive quantum simulations and thermochemical codes. We present a comprehensive comparison of machine learning models and several molecular featurization methods - sum over bonds, custom descriptors, Coulomb matrices, bag of bonds, and fingerprints. The best featurization was sum over bonds (bond counting), and the best model was kernel ridge regression. Despite having a small data set, we obtain acceptable errors and Pearson correlations for the prediction of detonation pressure, detonation velocity, explosive energy, heat of formation, density, and other properties out of sample. By including another dataset with 309 additional molecules in our training we show how the error can be pushed lower, although the convergence with number of molecules is slow. Our work paves the way for future applications of machine learning in this domain, including automated lead generation and interpreting machine learning models to obtain novel chemical insights.
科研通智能强力驱动
Strongly Powered by AbleSci AI