化学
计算
数据库
金属
算法
有机化学
计算机科学
作者
Andrew J. P. White,Marco Gibaldi,Jake Burner,Rebeca Mayo,Tom K. Woo
摘要
"Computation-ready" metal-organic framework (MOF) databases provide essential raw data for high-throughput computational screening (HTS) and machine-learning approaches to materials discovery. However, the structural fidelity of these databases remains largely unquantified. We introduce MOSAEC, an algorithm that detects chemically invalid structures based on metal oxidation states. MOSAEC was manually validated against 14,796 MOF structures from the popular CoRE database and found to flag erroneous structures with 96% accuracy. Examination of 14 leading experimental and hypothetical MOF databases containing >1.9 million structures reveals structural error rates exceeding 40% in most cases. Analysis of 8 recent HTS studies which highlighted top-performing candidates shows that 52% of these structures were chemically invalid.
科研通智能强力驱动
Strongly Powered by AbleSci AI