计算机科学
机器翻译
回指(语言学)
自然语言处理
共指
人工智能
串联(数学)
背景(考古学)
基于实例的机器翻译
翻译(生物学)
特征(语言学)
连贯性(哲学赌博策略)
分辨率(逻辑)
语言学
古生物学
生物化学
数学
哲学
物理
组合数学
量子力学
信使核糖核酸
基因
生物
化学
作者
Elena Voita,Pavel Serdyukov,Rico Sennrich,Ivan Titov
摘要
Standard machine translation systems process sentences in isolation and hence ignore extra-sentential information, even though extended context can both prevent mistakes in ambiguous cases and improve translation coherence. We introduce a context-aware neural machine translation model designed in such way that the flow of information from the extended context to the translation model can be controlled and analyzed. We experiment with an English-Russian subtitles dataset, and observe that much of what is captured by our model deals with improving pronoun translation. We measure correspondences between induced attention distributions and coreference relations and observe that the model implicitly captures anaphora. It is consistent with gains for sentences where pronouns need to be gendered in translation. Beside improvements in anaphoric cases, the model also improves in overall BLEU, both over its context-agnostic version (+0.7) and over simple concatenation of the context and source sentences (+0.6).
科研通智能强力驱动
Strongly Powered by AbleSci AI