标杆管理
水准点(测量)
离群值
计算机科学
试验装置
机器学习
图形
人工神经网络
集合(抽象数据类型)
聚类分析
人工智能
数据挖掘
模式识别(心理学)
理论计算机科学
业务
大地测量学
营销
程序设计语言
地理
作者
Kanishka Singh,Jannes Münchmeyer,Leon Weber,Ulf Leser,Annika Bande
标识
DOI:10.1021/acs.jctc.2c00255
摘要
Machine learning (ML) approaches have demonstrated the ability to predict molecular spectra at a fraction of the computational cost of traditional theoretical chemistry methods while maintaining high accuracy. Graph neural networks (GNNs) are particularly promising in this regard, but different types of GNNs have not yet been systematically compared. In this work, we benchmark and analyze five different GNNs for the prediction of excitation spectra from the QM9 dataset of organic molecules. We compare the GNN performance in the obvious runtime measurements, prediction accuracy, and analysis of outliers in the test set. Moreover, through TMAP clustering and statistical analysis, we are able to highlight clear hotspots of high prediction errors as well as optimal spectra prediction for molecules with certain functional groups. This in-depth benchmarking and subsequent analysis protocol lays down a recipe for comparing different ML methods and evaluating dataset quality.
科研通智能强力驱动
Strongly Powered by AbleSci AI