计算机科学
任务(项目管理)
词汇
语音识别
自然语言处理
编码(集合论)
代码转换
构造(python库)
人工智能
语言模型
语言学
哲学
管理
集合(抽象数据类型)
经济
程序设计语言
作者
Yuting Huang,Bi Zeng,Zhentao Lin,Jia Chen
标识
DOI:10.1109/icct56141.2022.10072473
摘要
Hong Kong Cantonese blends Cantonese and English due to its specific historical background and regional circumstance. This paper studies code-switching in Hong Kong-style Cantonese and proposes a Hong Kong Cantonese Speech Recognition (HKSR) method based on a multi-task rescoring strategy. Firstly, this work innovatively develops the Cantonese-English difference modelling unit to narrow modeling discrepancies between Cantonese and English, and simultaneously alleviate insufficient vocabulary issue caused by the lack of English in the data. Secondly, to better distinguish Cantonese from English, we construct a language identification(LID) subtask. Finally, to jointly train the LID and Automatic Speech Recognition(ASR), this paper develops a multi-task bilingual rescoring module based on U2 end-to-end model. We also investigate the impact of five different rescoring strategies, including multi-task bilingual rescoring, on Hong Kong Cantonese speech recognition. The experimental results demonstrate that HKSR combined with the multi-task bilingual rescoring strategy improves accuracy by 10%-49%
科研通智能强力驱动
Strongly Powered by AbleSci AI