计算机科学
推论
重新调整用途
数据科学
管道(软件)
信息抽取
构造(python库)
鉴定(生物学)
关系抽取
情报检索
人工智能
工程类
植物
生物
程序设计语言
废物管理
作者
Yuan Zhang,Xin Sui,Feng Pan,Kaixian Yu,Keqiao Li,Shubo Tian,Arslan Erdengasileng,Qing Han,Wanjing Wang,Jianan Wang,Jian Wang,Donghu Sun,Henry Shu-Hung Chung,Jun Zhou,Eric S. Zhou,Benjamin Lee,Peili Zhang,Xing Qiu,Tingting Zhao,Jinfeng Zhang
标识
DOI:10.1101/2023.10.13.562216
摘要
Abstract To address the rapid growth of scientific publications and data in biomedical research, knowledge graphs (KGs) have become a critical tool for integrating large volumes of heterogeneous data to enable efficient information retrieval and automated knowledge discovery (AKD). However, transforming unstructured scientific literature into KGs remains a significant challenge, with previous methods unable to achieve human-level accuracy. In this study, we utilized an information extraction pipeline that won first place in the LitCoin NLP Challenge (2022) to construct a large-scale KG named iKraph using all PubMed abstracts. The extracted information matches human expert annotations and significantly exceeds the content of manually curated public databases. To enhance the KG’s comprehensiveness, we integrated relation data from 40 public databases and relation information inferred from high-throughput genomics data. This KG facilitates rigorous performance evaluation of AKD, which was infeasible in previous studies. We designed an interpretable, probabilistic-based inference method to identify indirect causal relations and applied it to real-time COVID-19 drug repurposing from March 2020 to May 2023. Our method identified 600-1400 candidate drugs per month, with one-third of those discovered in the first two months later supported by clinical trials or PubMed publications. These outcomes are very challenging to attain through alternative approaches that lack a thorough understanding of the existing literature. A cloud-based platform ( https://biokde.insilicom.com ) was developed for academic users to access this rich structured data and associated tools.
科研通智能强力驱动
Strongly Powered by AbleSci AI