计算机科学
任务(项目管理)
水准点(测量)
选择(遗传算法)
人工智能
词(群论)
对象(语法)
情态动词
自然语言处理
注意力网络
情报检索
机器学习
化学
经济
管理
高分子化学
哲学
地理
语言学
大地测量学
作者
Yunfei Liu,Shengyang Li,Feihu Hu,Anqi Liu,Yanan Liu
出处
期刊:Communications in computer and information science
日期:2022-01-01
卷期号:: 83-94
被引量:2
标识
DOI:10.1007/978-981-19-7596-7_7
摘要
Multi-modal named entity recognition (MNER) is a multi-modal task aim to discover named entities in text with visual information. Existing MNER approaches model dense interactions between visual objects and textual words by designing co-attention mechanisms to achieve better accuracy. However, mapping interactions between each semantic unit (visual object and textual word) will force the model to calculate irrelevant information, which results in the model’s attention to be distracted. In this paper, to tackle the problem, we propose a novel model which concentrates the model’s attention by explicitly selecting the most relevant segments to predict entities. This method based on top-k selection can reduce the interference caused by irrelevant information and ultimately help the model to achieve better performance. Experimental results on benchmark dataset demonstrate the effectiveness of our MNER model.
科研通智能强力驱动
Strongly Powered by AbleSci AI