重采样
可用性
机器学习
采样(信号处理)
人工智能
化学
大数据
计算机科学
数据挖掘
计算机视觉
滤波器(信号处理)
人机交互
作者
Shengzhou Li,Ayako Nakata
标识
DOI:10.1093/chemle/upae090
摘要
Abstract Materials science research benefits from the powerful machine-learning (ML) surrogate models, but it is also limited by the implicit requirement for sufficiently big and balanced data distribution for ML. In this paper, we propose a model to obtain more credible results for small and imbalanced materials data sets as well as chemical knowledge. Taking 2 bandgaps imbalanced data sets as instances, we demonstrate the usability and performance of our model compared with common ML models with normal sampling and resampling methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI