管道(软件)
计算机科学
关系(数据库)
模棱两可
合成生物学
任务(项目管理)
信息抽取
数据科学
系统生物学
情报检索
作者
Bridget T. McInnes,J. Stephen Downie,Yikai Hao,Jacob Jett,Kevin Keating,Gaurav Nakum,Sudhanshu Ranjan,Nicholas E. Rodriguez,Jiawei Tang,Du Xiang,Eric M. Young,Mai H. Nguyen
标识
DOI:10.1021/acssynbio.1c00611
摘要
Scientific articles contain a wealth of information about experimental methods and results describing biological designs. Due to its unstructured nature and multiple sources of ambiguity and variability, extracting this information from text is a difficult task. In this paper, we describe the development of the synthetic biology knowledge system (SBKS) text processing pipeline. The pipeline uses natural language processing techniques to extract and correlate information from the literature for synthetic biology researchers. Specifically, we apply named entity recognition, relation extraction, concept grounding, and topic modeling to extract information from published literature to link articles to elements within our knowledge system. Our results show the efficacy of each of the components on synthetic biology literature and provide future directions for further advancement of the pipeline.
科研通智能强力驱动
Strongly Powered by AbleSci AI