计算机科学
拼音
命名实体识别
人工智能
自然语言处理
特征(语言学)
实体链接
代表(政治)
词(群论)
边界(拓扑)
任务(项目管理)
政府(语言学)
汉字
语言学
知识库
数学分析
哲学
数学
管理
政治
政治学
法学
经济
作者
Zhenxiang Sun,Runyuan Sun,Zhifeng Liang,Zhuang Su,Yongxin Yu,Shuainan Wu
标识
DOI:10.1007/978-981-99-4752-2_55
摘要
Pre-trained language models usher in a new era of named entity recognition, but more additional relevant knowledge is needed to improve its performance on specific problems. In particular, in Chinese government named entity recognition, most entities are lengthy and have vague boundaries, and this entity length and boundary uncertainty makes the entity recognition task difficult or incorrectly identified. To address this problem, this paper proposes a Chinese named entity recognition model based on multi-feature fusion, in which lexical features, word boundary features and pinyin features are fused together through a multi-headed attention mechanism to enhance the model’s semantic representation of government texts. Meanwhile, this paper also studied the contribution of different features to entity recognition, and finds that pinyin features have unique advantages in recognising government entities. This study provides new ideas and methods for the research and application of Chinese governmental entity recognition, and also provides some insights into the problem of named entity recognition in other language domains. The experimental results show that the model proposed in this paper has better performance compared to the baseline model.
科研通智能强力驱动
Strongly Powered by AbleSci AI