Convolutional neural network ensemble for Parkinson's disease detection from voice recordings

构音障碍卷积神经网络计算机科学任务（项目管理）语音识别人工智能人口灵敏度（控制系统）特征提取模式识别（心理学）机器学习听力学医学工程类管理经济环境卫生电子工程

作者

Máté Hireš,Matej Gazda,Peter Drotár,Nemuel Daniel Pah,Mohammod Abdul Motin,Dinesh Kumar

出处

期刊：Computers in Biology and Medicine [Elsevier BV]
日期：2021-11-09 卷期号：141: 105021-105021 被引量：96

链接

nih.govdoi.org

标识

DOI：10.1016/j.compbiomed.2021.105021

摘要

The computerized detection of Parkinson's disease (PD) will facilitate population screening and frequent monitoring and provide a more objective measure of symptoms, benefiting both patients and healthcare providers. Dysarthria is an early symptom of the disease and examining it for computerized diagnosis and monitoring has been proposed. Deep learning-based approaches have advantages for such applications because they do not require manual feature extraction, and while this approach has achieved excellent results in speech recognition, its utilization in the detection of pathological voices is limited. In this work, we present an ensemble of convolutional neural networks (CNNs) for the detection of PD from the voice recordings of 50 healthy people and 50 people with PD obtained from PC-GITA, a publicly available database. We propose a multiple-fine-tuning method to train the base CNN. This approach reduces the semantical gap between the source task that has been used for network pretraining and the target task by expanding the training process by including training on another dataset. Training and testing were performed for each vowel separately, and a 10-fold validation was performed to test the models. The performance was measured by using accuracy, sensitivity, specificity and area under the ROC curve (AUC). The results show that this approach was able to distinguish between the voices of people with PD and those of healthy people for all vowels. While there were small differences between the different vowels, the best performance was when/a/was considered; we achieved 99% accuracy, 86.2% sensitivity, 93.3% specificity and 89.6% AUC. This shows that the method has potential for use in clinical practice for the screening, diagnosis and monitoring of PD, with the advantage that vowel-based voice recordings can be performed online without requiring additional hardware.

求助该文献

最长约 10秒，即可获得该文献文件

Convolutional neural network ensemble for Parkinson's disease detection from voice recordings

今日热心研友