发布文献求助

Multi-modal graph contrastive encoding for neural machine translation

计算机科学情态动词机器翻译人工智能图形判决自然语言处理场景图模式识别（心理学）理论计算机科学化学高分子化学渲染（计算机图形）

作者

Youtan Yin,Jiali Zeng,Jinsong Su,Chulun Zhou,Fandong Meng,Jie Zhou,Degen Huang,Jiebo Luo

出处

期刊：Artificial Intelligence [Elsevier]
日期：2023-10-01 卷期号：323: 103986-103986 被引量：1

标识

DOI：10.1016/j.artint.2023.103986

摘要

As an important extension of conventional text-only neural machine translation (NMT), multi-modal neural machine translation (MNMT) aims to translate input source sentences paired with images into the target language. Although a lot of MNMT models have been proposed to perform multi-modal semantic fusion, they do not consider fine-grained semantic correspondences between semantic units of different modalities (i.e., words and visual objects), which can be exploited to refine multi-modal representation learning via fine-grained semantic interactions. To address this issue, we propose a graph-based multi-modal fusion encoder for NMT. Concretely, we first employ a unified multi-modal graph to represent the input sentence and image, in which the multi-modal semantic units are considered as the nodes in the graph, connected by two kinds of edges with different semantic relationships. Then, we stack multiple graph-based multi-modal fusion layers that iteratively conduct intra- and inter-modal interactions to learn node representations. Finally, via an attention mechanism, we induce a multi-modal context from the top node representations for the decoder. Particularly, we introduce a progressive contrastive learning strategy based on the multi-modal graph to refine the training of our proposed model, where hard negative samples are introduced gradually. To evaluate our model, we conduct experiments on commonly-used datasets. Experimental results and analysis show that our MNMT model obtains significant improvements over competitive baselines, achieving state-of-the-art performance on the Multi30K dataset.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2024年影响因子查询已上线 (2024-6-20)

更新

大幅提高文件上传限制，最高150M (2024-4-1)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 风中梦蕊完成签到，获得积分10

2秒前; 852的应助被YC采纳，获得10

3秒前; llbeyond的应助被科研通管家采纳，获得30

6秒前; 和平使命的应助被科研通管家采纳，获得10

6秒前; 慕青的应助被科研通管家采纳，获得10

6秒前; 杰行天下完成签到，获得积分10

9秒前; 王小西完成签到，获得积分10

9秒前; organic tirrttf完成签到，获得积分10

10秒前; chengwanying完成签到，获得积分10

11秒前; tingalan完成签到，获得积分10

11秒前; Kelly1426完成签到，获得积分10

11秒前; 共享精神上传了应助文件

11秒前; mingcheng完成签到，获得积分10

13秒前; 刚子完成签到，获得积分0

15秒前; 优雅沛文完成签到，获得积分10

15秒前; 谦虚发布了新的文献求助10

17秒前; 自信放光芒~完成签到，获得积分10

21秒前; msl2023完成签到，获得积分10

22秒前; 迁小yan完成签到，获得积分10

23秒前; 顺利毕业完成签到，获得积分10

29秒前; YAN完成签到，获得积分10

30秒前; 摸鱼人完成签到，获得积分10

34秒前; 书生完成签到，获得积分10

36秒前; Zzy完成签到，获得积分10

37秒前; 乔杰完成签到，获得积分10

37秒前; drizzling完成签到，获得积分10

37秒前; 朴实寻琴完成签到，获得积分10

40秒前; GJL关注了科研通微信公众号

42秒前; meng完成签到，获得积分10

43秒前; du2002完成签到，获得积分10

44秒前; 鱼丸完成签到，获得积分10

45秒前; 田様上传了应助文件

45秒前; Shabby0-0完成签到，获得积分10

46秒前; CAST1347完成签到，获得积分10

51秒前; 天令发布了新的文献求助30

51秒前; 泥娃娃完成签到，获得积分10

51秒前; 楚奇完成签到，获得积分10

51秒前; 任性青烟完成签到，获得积分10

53秒前; 嘟嘟豆806完成签到，获得积分10

55秒前; 科研通AI2S上传了应助文件

56秒前

高分求助中: 좌파는 어떻게 좌파가 됐나:한국 급진노동운동의 형성과 궤적 2500; Sustainability in Tides Chemistry 1500; TM 5-855-1(Fundamentals of protective design for conventional weapons) 1000; Cognitive linguistics critical concepts in linguistics 800; Threaded Harmony: A Sustainable Approach to Fashion 799; Livre et militantisme : La Cité éditeur 1958-1967 500; 氟盐冷却高温堆非能动余热排出性能及安全分析研究 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3052675; 求助须知：如何正确求助？哪些是违规求助？ 2709898; 关于积分的说明 7418335; 捐赠科研通 2354494; 什么是DOI，文献DOI怎么找？ 1246139; 科研通“疑难数据库（出版商）”最低求助积分说明 605951; 版权声明 595921

今日热心研友

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：826996720【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通