학술논문

A Real-Time DSP-Based System for Voice Activity Detection: Design and Implement
Document Type
Article
Text
Source
International Journal of Signal Processing, Image Processing and Pattern Recognition, 12/30/2013, Vol. 6, Issue 6, p. 27-40
Subject
VAD
FFT magnitudes
Gamma distribution
Rayleigh distribution
DSP
Language
English
ISSN
2005-4254
Abstract
Most of the noise in speech communication lines can be considered as Gaussian white noise. Voice activity detection (VAD) in noisy environment is an important process in many speech signal processing algorithms. Unlike the other VAD algorithms, this paper proposes a simple and novel VAD algorithm based on the probability distribution function (PDF) of FFT magnitudes of both clean speech and Gaussian white noise. When the signal-to-noise ratio (SNR) is high enough, the method using Gamma distribution to detect the speech performs well, while the method using Rayleigh distribution under lower SNR can be complementary. In addition, the threshold to determine which method to use is presented based on the tests under different SNR. Simulation results show that the proposed algorithm is efficient. Both the hardware and software of a low cost system for VAD are introduced, with the proposed algorithm achieved in a digital signal processor (DSP). Each detection takes on less than 100 ms, which can be used for real-time processing.