吸附
极限学习机
集成学习
支持向量机
感知器
梯度升压
化学
随机森林
Boosting(机器学习)
特征选择
线性回归
人工智能
机器学习
计算机科学
数学
人工神经网络
物理化学
作者
Hong Zhu,Hui Guo,Zhihui Liu,Zhao‐Xu Chen
摘要
Metal-based catalysts are widely used in many kinds of reactions, including the hydrogenation of CO2 to alcohols. Adsorption energies of key intermediates have often been used as descriptors in high-throughput catalyst screening. However, establishing machine learning models to accurately predict adsorption energies of widely spanned species is still challenging. In the present article, we explored the predictive power of sure independence screening and sparsifying operator (SISSO), multilayer perceptron regression (MLPR), random forest regression (RFR), kernel ridge regression (KRR), support vector regression (SVR), eXtreme Gradient Boosting (XGBoost), and 20 ensemble machine learning (ML) methods for adsorption energies of 57 species involved in CO2 hydrogenation to ethanol using surface features, adsorbate features, and adsorption site features. The results show that SISSO and the five base ML methods cannot furnish models with the maximum absolute error (MAX) comparable to DFT errors. On the other hand, the MAX of most two-component and three-component ensemble ML methods is less than 0.3 eV, and the KRR+MLPR+XGBoost ensemble ML model performs the best, with the mean absolute error being 0.03 eV and MAX of only 0.17 eV. Feature importance analysis reveals that the condensed local softness is the most important feature, and there are linear relations between the condensed local softness and adsorption energies of 10 C1 species on all considered surfaces. The present work shows that ensemble ML methods outperform the base ML methods for predicting adsorption energies of widely ranged species with satisfactory accuracy and deserve further studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI