计算机科学
命名实体识别
编码器
知识库
领域(数学)
领域知识
条件随机场
变压器
人工智能
知识抽取
领域(数学分析)
机器学习
数据挖掘
数学
数学分析
物理
管理
量子力学
电压
纯数学
经济
任务(项目管理)
操作系统
作者
Yafei Liu,Siqi Wei,Haijun Huang,Quirino Lai,Mengshan Li,Lixin Guan
标识
DOI:10.1016/j.eswa.2023.121103
摘要
In the agricultural industry, there is still a need for improved systematic integration of knowledge related to citrus pests and diseases. Creating a knowledge base focused on citrus pests and diseases and developing a knowledge map would enable the formation of a system providing accurate and convenient prevention and control methods for citrus growers. However, the majority of existing data on citrus pests and diseases is unstructured, making identifying named entities from large volumes of unstructured text data particularly important for creating a knowledge map. In this paper, we propose a training model based on Bidirectional Encoder Representation from Transformers (BERT), combined with Bidirectional Long and Short Term Memory Networks (BiLSTM) and Conditional Random Field (CRF), to extract specific entity categories from unstructured data. When tested on a created dataset, the model achieved an accuracy rate of 0.9423 and an f1 value of 0.8048. The results demonstrate that the proposed method can be applied to extract specific entity categories from text data related to citrus pests and diseases, laying a solid foundation for the subsequent construction of a knowledge map. This paper introduces a method for named entity recognition in the field of citrus pests and diseases, which could serve as a reference for constructing knowledge maps in other fields and improving the utilization of domain knowledge.
科研通智能强力驱动
Strongly Powered by AbleSci AI