计算机科学
机器翻译
人工智能
机器学习
多模态
自然语言处理
自动化
翻译(生物学)
生物化学
基因
信使核糖核酸
万维网
化学
机械工程
工程类
作者
Lin Li,Turghun Tayir,Yifeng Han,Xiaohui Tao,Juan D. Velásquez
标识
DOI:10.1016/j.inffus.2022.10.018
摘要
Machine translation is a popular automation approach for translating texts between different languages. Although traditionally it has a strong focus on natural language, images can potentially provide an additional source of information in machine translation. However, there are presently two challenges: (i) the lack of an effective fusion method to handle the triangular-mapping function between image, text, and semantic knowledge; and (ii) the accessibility of large-scale parallel corpus to train a model for generating accurate machine translations. To address these challenges, this work proposes an effective multimodality information fusion method for automated machine translation based on semi-supervised learning. The method fuses multimodality information, texts and images to deliver automated machine translation. Specifically, our objective fuses multimodalities with alignment in a multimodal attention network, which advances the method through the power of mapping text and image features to their semantic information with accuracy. Moreover, a semi-supervised learning method is utilized for its capability in using a small number of parallel corpus for supervised training on the basis of unsupervised training. Conducted on the Multi30k dataset, the experimental results shows the promising performance of our proposed fusion method compared with state-of-the-art approaches.
科研通智能强力驱动
Strongly Powered by AbleSci AI