财产(哲学)
计算机科学
自然语言处理
数据科学
认识论
哲学
作者
Wonseok Lee,Yeonghun Kang,Taeun Bae,Jihan Kim
出处
期刊:Cornell University - arXiv
日期:2024-03-31
标识
DOI:10.48550/arxiv.2404.13053
摘要
This research was focused on the efficient collection of experimental Metal-Organic Framework (MOF) data from scientific literature to address the challenges of accessing hard-to-find data and improving the quality of information available for machine learning studies in materials science. Utilizing a chain of advanced Large Language Models (LLMs), we developed a systematic approach to extract and organize MOF data into a structured format. Our methodology successfully compiled information from more than 40,000 research articles, creating a comprehensive and ready-to-use dataset. The findings highlight the significant advantage of incorporating experimental data over relying solely on simulated data for enhancing the accuracy of machine learning predictions in the field of MOF research.
科研通智能强力驱动
Strongly Powered by AbleSci AI