普通话
语音识别
语调(文学)
计算机科学
特征(语言学)
集合(抽象数据类型)
人工智能
字错误率
语言学
哲学
程序设计语言
摘要
To effectively help second language (L2) Chinese learners to produce tones correctly in computer assisted language learning (CALL), tone recognition of continuous speech is necessary. Because of the complex tone variation in continuous speech, this paper proposed TAM-BLSTM tone recognition model. Firstly, the generation model, target approximation model (TAM) is used to simulate fundamental frequency (f0) from original f0 contour in the unit of prosodic words, and the TAM parameters for each Chinese character are derived. Then BLSTM model with attention mechanism is set up with input feature of the TAM parameters and basic acoustic features, such as statistical f0 parameters, vowel duration, to solve the problem of tone detection of Mandarin continuous speech. Finally, the trained tone detection model is applied to the tone error detection of the L2 learners. The experimental results with Biaobei corpus show that the accuracy of the feature set combined with TAM parameters is 2.3% higher than that of using basic acoustic features alone, and the overall accuracy of ATT-BLSTM network model is higher than that based on ATT-LSTM.
科研通智能强力驱动
Strongly Powered by AbleSci AI