机器翻译
计算机科学
人工智能
融合
翻译(生物学)
自然语言处理
工程制图
计算机视觉
工程类
语言学
生物化学
基因
信使核糖核酸
哲学
化学
作者
Chenghao He,Quzong Gesang,Nuo Qun,Gadeng Luosang,Nyima Tashi
标识
DOI:10.1109/iotaai62601.2024.10692893
摘要
This article explores a Tibetan-Chinese machine translation model based on multimodal alignment of images and texts, using the Resnet50 model for feature extraction of images, the TIP-LAS Tibetan Segmentation Lexical Annotation System tool for lexical annotation and segmentation of Tibetan texts, and the Stuttering tool for lexical annotation and segmentation of Chinese texts. Experiments on the same graphic Tibetan-Chinese dataset and CMXT2022 Tibetan-Chinese dataset show that the Tibetan-Chinese machine translation model based on graphic multimodal alignment is higher than the plain text Tibetan-Chinese machine translation translation-model based on Transformer by 2.09 BLEU points and 2.33 BLEU points, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI