학술논문

Whitening Transformation of i-vectors in Closed-Set Speaker Verification of Children
Document Type
Conference
Source
2023 10th International Conference on Signal Processing and Integrated Networks (SPIN) Signal Processing and Integrated Networks (SPIN), 2023 10th International Conference on. :243-248 Mar, 2023
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Fields, Waves and Electromagnetics
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Analytical models
Speech recognition
Signal processing
Probabilistic logic
Regulation
Safety
Speaker recognition
Automatic speaker verification
Gaussian mixture model
Probabilistic linear discriminant analysis
Whitening transformation
Zero-phase component analysis.
Language
ISSN
2688-769X
Abstract
Automatic speaker recognition may be beneficial for children in a wide variety of disciplines, such as child education, security and safety. However, owing to a variety of issues in children’s speech, including immature vocal tracts, native phonology, long pauses, and hesitations, many speech technology applications have trouble recognizing non-native children. In order to assess how the children’s speech impacts the system, the primary objective of this study is to build a closed-set child speaker verffication system for non-native English speakers in both textdependent and text-independent tasks. This study focuses on a Gaussian probabilistic linear discriminant analysis (GPLDA) model that was trained employing i-vectors and several whitening methods. Cosine similarity scoring (CSS) and GPLDA scoring are used to evaluate the claimed outcomes. Additionally, it has been shown that in closedset speaker verification, the zero-phase component analysis (ZCA) transformation outperformed the other whitening transformations.