학술논문

Envelope-Based Multichannel Noise Reduction for Cochlear Implant Applications
Document Type
Periodical
Source
IEEE/ACM Transactions on Audio, Speech, and Language Processing IEEE/ACM Trans. Audio Speech Lang. Process. Audio, Speech, and Language Processing, IEEE/ACM Transactions on. 32:1873-1884 2024
Subject
Signal Processing and Analysis
Computing and Processing
Communication, Networking and Broadcast Technologies
General Topics for Engineers
Noise reduction
Ear
Acoustics
Array signal processing
Time-domain analysis
Maximum likelihood detection
Hearing aids
Beamforming
cochlear implants
microphone array
multichannel filter
noise reduction
speech processing
Language
ISSN
2329-9290
2329-9304
Abstract
Cochlear implants (CI) are implantable electronic prostheses that aim to restore communication in people with severe to profound hearing loss. This is achieved by transforming the acoustic signals into electrical stimuli and applying them directly to the cochlea through a set of electrodes. Despite its effectiveness under optimal environmental conditions, its performance is severely degraded in the presence of noise. This work proposes an envelope-based minimum-variance distortionless response (MVDR) approach for multichannel noise reduction in CI applications. The original constraint in the conventional time-domain MVDR beamformer is modified to ensure that the second-order statistical moments of the processed and original speech-only envelopes are equivalent. Assuming strict computational limitations, as those found in commercial multi-electrode systems, a low computational-cost particular case of the proposed method results in a power-based MVDR that has a semi-analytical solution. The resulting algorithm provides a fast calculation method for its optimal coefficients without losing significant performance compared to its general form. Results obtained from computational simulations and objective performance criteria indicate higher intelligibility levels achieved by the proposed methods compared to the conventional time-domain MVDR beamformer. This evidence is corroborated by psychoacoustic experiments with ten normal-hearing volunteers listening to vocoded speech, and one CI user employing a research interface.