Structure-based sampling and self-correcting machine learning for accurate calculations of potential energy surfaces and vibrational levels

旋转振动光谱学 从头算 采样(信号处理) 计算机科学 能量(信号处理) 核(代数) 集合(抽象数据类型) 算法 人工智能 机器学习 数学 物理 分子 统计 量子力学 离散数学 滤波器(信号处理) 计算机视觉 程序设计语言
作者
Pavlo O. Dral,A. Owens,S. N. Yurchenko,Walter Thiel
出处
期刊:Journal of Chemical Physics [American Institute of Physics]
卷期号:146 (24) 被引量:143
标识
DOI:10.1063/1.4989536
摘要

We present an efficient approach for generating highly accurate molecular potential energy surfaces (PESs) using self-correcting, kernel ridge regression (KRR) based machine learning (ML). We introduce structure-based sampling to automatically assign nuclear configurations from a pre-defined grid to the training and prediction sets, respectively. Accurate high-level ab initio energies are required only for the points in the training set, while the energies for the remaining points are provided by the ML model with negligible computational cost. The proposed sampling procedure is shown to be superior to random sampling and also eliminates the need for training several ML models. Self-correcting machine learning has been implemented such that each additional layer corrects errors from the previous layer. The performance of our approach is demonstrated in a case study on a published high-level ab initio PES of methyl chloride with 44 819 points. The ML model is trained on sets of different sizes and then used to predict the energies for tens of thousands of nuclear configurations within seconds. The resulting datasets are utilized in variational calculations of the vibrational energy levels of CH3Cl. By using both structure-based sampling and self-correction, the size of the training set can be kept small (e.g., 10% of the points) without any significant loss of accuracy. In ab initio rovibrational spectroscopy, it is thus possible to reduce the number of computationally costly electronic structure calculations through structure-based sampling and self-correcting KRR-based machine learning by up to 90%.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
xs完成签到,获得积分10
1秒前
完美世界应助shane采纳,获得10
1秒前
风趣的梦露完成签到,获得积分10
1秒前
2秒前
赘婿应助亮123采纳,获得10
2秒前
青山发布了新的文献求助10
2秒前
2秒前
Mollyshimmer完成签到 ,获得积分10
3秒前
3秒前
完美世界应助腼腆的立辉采纳,获得10
3秒前
完美世界应助王韬采纳,获得10
3秒前
kaifeiQi发布了新的文献求助10
3秒前
5秒前
6秒前
核桃应助小慧采纳,获得20
7秒前
天天快乐应助学术laji采纳,获得10
7秒前
Title发布了新的文献求助10
7秒前
xs发布了新的文献求助10
7秒前
7秒前
烟花应助榴莲姑娘采纳,获得10
7秒前
莲枳榴莲完成签到,获得积分10
8秒前
852应助凯不会取名采纳,获得10
8秒前
zzx发布了新的文献求助10
8秒前
冷静的天与完成签到 ,获得积分20
8秒前
8秒前
9秒前
9秒前
9秒前
9秒前
zzz完成签到 ,获得积分10
10秒前
planck完成签到,获得积分10
10秒前
10秒前
sonic驳回了wanci应助
10秒前
10秒前
桐桐应助科研龙采纳,获得10
10秒前
11秒前
WZH完成签到 ,获得积分10
11秒前
11秒前
chandangfo应助可靠的寒风采纳,获得50
11秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
晶种分解过程与铝酸钠溶液混合强度关系的探讨 8888
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6422160
求助须知:如何正确求助?哪些是违规求助? 8241098
关于积分的说明 17516298
捐赠科研通 5476068
什么是DOI,文献DOI怎么找? 2892725
邀请新用户注册赠送积分活动 1869198
关于科研通互助平台的介绍 1706600