학술논문

Direction of arrival estimation for speech sources using fourth order cross cumulants
Document Type
Conference
Source
2008 IEEE International Symposium on Circuits and Systems (ISCAS) Circuits and Systems (ISCAS), 2008 IEEE International Symposium on. :1696-1699 May, 2008
Subject
Components, Circuits, Devices and Systems
Communication, Networking and Broadcast Technologies
Engineered Materials, Dielectrics and Plasmas
Direction of arrival estimation
Speech enhancement
Acoustic sensors
Statistics
Sensor arrays
Gaussian noise
Speech processing
Signal processing algorithms
Probability density function
Laplace equations
Language
ISSN
0271-4302
2158-1525
Abstract
In many applications where speech separation and enhancement is of interest, e.g. conferencing systems, mobile phones and hearing aids, accurate speaker localization is important. This paper presents an alternative criteria for the well known Steered Response Power with Phase Transform (SRP-PHAT) algorithm, in which the steered response relates to peaks in the fourth order cross cumulant, rather than peaks in the second order cross cumulant, i.e. the cross power spectrum. Since speech sources have a Probability Density Function (PDF) close to the Laplacian distribution and noise are generally closer to the Gaussian distribution, the fourth order cumulant becomes a good alternative for the steered response search for speech sources. The proposed method is evaluated and compared to the original SRP-PHAT algorithm and shows significant improvements in localization performance for speech sources.