학술논문

Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis

Document Type

article

Author

Thomas Soroski; Thiago da Cunha Vasco; Sally Newton-Mason; Saffrin Granby; Caitlin Lewis; Anuj Harisinghani; Matteo Rizzo; Cristina Conati; Gabriel Murray; Giuseppe Carenini; Thalia S Field; Hyeju Jang

Source

JMIR Aging, Vol 5, Iss 3, p e33460 (2022)

Subject

Geriatrics
RC952-954.6

Language

English

ISSN

2561-7605

Abstract

BackgroundSpeech data for medical research can be collected noninvasively and in large volumes. Speech analysis has shown promise in diagnosing neurodegenerative disease. To effectively leverage speech data, transcription is important, as there is valuable information contained in lexical content. Manual transcription, while highly accurate, limits the potential scalability and cost savings associated with language-based screening. ObjectiveTo better understand the use of automatic transcription for classification of neurodegenerative disease, namely, Alzheimer disease (AD), mild cognitive impairment (MCI), or subjective memory complaints (SMC) versus healthy controls, we compared automatically generated transcripts against transcripts that went through manual correction. MethodsWe recruited individuals from a memory clinic (“patients”) with a diagnosis of mild-to-moderate AD, (n=44, 30%), MCI (n=20, 13%), SMC (n=8, 5%), as well as healthy controls (n=77, 52%) living in the community. Participants were asked to describe a standardized picture, read a paragraph, and recall a pleasant life experience. We compared transcripts generated using Google speech-to-text software to manually verified transcripts by examining transcription confidence scores, transcription error rates, and machine learning classification accuracy. For the classification tasks, logistic regression, Gaussian naive Bayes, and random forests were used. ResultsThe transcription software showed higher confidence scores (P.05) for speech from healthy controls compared with patients. Classification models using human-verified transcripts significantly (P

Online Access

Full Text (ProQuest Central) Open Access (DOAJ) Open Access (EBSCO) Web of Science Scopus Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송