计算机科学
构造(python库)
注释
命名实体识别
自然语言处理
人工智能
中医药
知识抽取
价值(数学)
情报检索
机器学习
医学
病理
经济
管理
程序设计语言
替代医学
任务(项目管理)
作者
Zijun Song,Wen Xu,Zhitao Liu,Liang Chen,Hongye Su
标识
DOI:10.1109/iciea58696.2023.10241595
摘要
Traditional Chinese medicine (TCM) documents have been handed down through the ages, containing rich theoretical knowledge and clinical experience. These unstructured data are the foundation for building the digital knowledge system of TCM. However, written in ancient Chinese, the TCM books have complex grammatical rules and terms which are different from modern medicine, inducing difficulty in entity annotation and recognition. In order to solve the problem of lacking labeled data, we construct a dataset with Wenbing Tiaobian, a classic work of TCM on the warm disease, identify six entities and annotate the book with the BIOES method. The BERT-BILSTM-CRF model is used to conduct experiments on the dataset with an F1 value of 91.4%. The results verify the effectiveness of the model in NER tasks and advance the construction of knowledge graphs in TCM.
科研通智能强力驱动
Strongly Powered by AbleSci AI