语音识别
计算机科学
振动
信号(编程语言)
特征(语言学)
人工智能
预处理器
模式识别(心理学)
特征提取
声学
语言学
哲学
物理
程序设计语言
作者
Qianqian Zhang,Kun Tong
出处
期刊:IEEE Sensors Journal
[Institute of Electrical and Electronics Engineers]
日期:2023-12-15
卷期号:23 (24): 31380-31388
标识
DOI:10.1109/jsen.2023.3321987
摘要
The vocal cord vibration signal carries as much rich phonetic information as the speech signal. However, conventional automatic speech recognition (ASR) technology cannot effectively deal with vocal cord vibration signal. In this paper, a wearable throat speech detection sensor (TSDS) based on graphene sheets is designed. Subsequently, a large number of Chinese and English words are collected, and a dataset of vocal cord vibration for recognition tasks is established. Applying the knowledge of transfer learning, we use XGBoost applied to bearing fault detection as a source model, combined with feature engineering technology PCA, and propose an extreme gradient boosting model fused with principal component analysis for efficient identification of vocal fold vibration signals. The model reduces the data dimension, saves the operation time, and can automatically extract the characteristic parameters with a large contribution to the vocal cord vibration signal at the same time.
科研通智能强力驱动
Strongly Powered by AbleSci AI