학술논문

Utilizing Deep Learning on Limited Mobile Speech Recordings for Detection of Obstructive Pulmonary Disease
Document Type
Conference
Source
2022 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) Engineering in Medicine & Biology Society (EMBC), 2022 44th Annual International Conference of the IEEE. :1338-1341 Jul, 2022
Subject
Bioengineering
Deep learning
Performance evaluation
Pathology
Pulmonary diseases
Wearable computers
Transfer learning
Neural networks
Language
ISSN
2694-0604
Abstract
Passive assessment of obstructive pulmonary disease has gained substantial interest over the past few years in the mobile and wearable computing communities. One of the promising approaches is speech-based pulmonary assessment wherein spontaneous or scripted speech is used to evaluate an individual's pulmonary condition. Recent approaches in this regard heavily rely on accurate speech activity segmentation and specific, hand-crafted features. In this paper, we present an end-to-end deep learning approach for detecting obstructive pulmonary disease. We leveraged transfer learning using a network pre-trained for a different audio-based task, and employed our own additional shallow network on top as a binary classifier to indicate if a given speech recording belongs to an asthma or COPD patient. The additional network was a fully connected neural net with 2 hidden layers, and this was evaluated on two real-world datasets. We demonstrated that the system can identify subjects with obtructive pulmonary disease using their speech with 88.3 % precision, 88.8 % recall and 88.3% F-1 score using 10-fold cross-validation. The system showed improved performance in identifying the most severely affected subgroup of patients in the dataset, with an average 93.6 % accuracy.