计算机科学
自然语言处理
班图语
词(群论)
人工智能
语音识别
分割
语音分割
机器翻译
文本分割
语言模型
语言学
哲学
作者
Pierre Godard,Marcely Zanon-Boito,Lucas Ondel,Alexandre Bérard,François Yvon,Aline Villavicencio,Laurent Besacier
标识
DOI:10.21437/interspeech.2018-1308
摘要
We present a first attempt to perform attentional word segmentation directly from the speech signal, with the final goal to automatically identify lexical units in a low-resource, unwritten language (UL). Our methodology assumes a pairing between recordings in the UL with translations in a well-resourced language. It uses Acoustic Unit Discovery (AUD) to convert speech into a sequence of pseudo-phones that is segmented using neural soft-alignments produced by a neural machine translation model. Evaluation uses an actual Bantu UL, Mboshi; comparisons to monolingual and bilingual baselines illustrate the potential of attentional word segmentation for language documentation.
科研通智能强力驱动
Strongly Powered by AbleSci AI