计算机科学
变压器
发音
自然语言处理
编码器
人工智能
命名实体识别
语音识别
编码
汉字
印度
Glyph(数据可视化)
可视化
语言学
中国
管理
政治学
任务(项目管理)
生物化学
法学
操作系统
化学
电压
经济
量子力学
哲学
物理
基因
作者
Baohua Zhang,Jiejin Cai,Huaping Zhang,Jianyun Shang
标识
DOI:10.1016/j.ipm.2023.103314
摘要
Many Chinese NER models only focus on lexical and radical information, ignoring the fact that there are also certain rules for the pronunciation of Chinese entities. In this paper, we propose VisPhone, which incorporates Chinese characters’ Phonetic features into Transformer Encoder along with the Lattice and Visual features. We present the common rules for the pronunciation of Chinese entities and explore the most appropriate method to encode it. VisPhone uses two identical cross transformer encoders to fuse the visual and phonetic features of the input characters with the text embedding. A selective fusion module is used to get the final features. We conducted experiments on four well-known Chinese NER benchmark datasets: OntoNotes4.0, MSRA, Resume, and Weibo, with F1 scores of 82.63%, 96.07%, 96.26%, 70.79% respectively, improving the performance by 0.79%, 0.32%, 0.39%, and 3.47%. Our ablation experiments have also demonstrated the effectiveness of VisPhone.
科研通智能强力驱动
Strongly Powered by AbleSci AI