학술논문

Dynamic-Structured Reservoir Spiking Neural Network in Sound Localization
Document Type
Periodical
Source
IEEE Access Access, IEEE. 12:24596-24608 2024
Subject
Aerospace
Bioengineering
Communication, Networking and Broadcast Technologies
Components, Circuits, Devices and Systems
Computing and Processing
Engineered Materials, Dielectrics and Plasmas
Engineering Profession
Fields, Waves and Electromagnetics
General Topics for Engineers
Geoscience
Nuclear Engineering
Photonics and Electrooptics
Power, Energy and Industry Applications
Robotics and Control Systems
Signal Processing and Analysis
Transportation
Location awareness
Neurons
Computer architecture
Encoding
Biological neural networks
Recurrent neural networks
Heuristic algorithms
Acoustics
Resonant frequency
Sound localization
spiking neural network
dynamic structure
ITD
energy-based method
adaptive resonance theory
Language
ISSN
2169-3536
Abstract
Sound source localization is a critical problem in various fields, including communication, security, and entertainment. Binaural cues are a natural technique used by mammalian ears for efficient sound source localization. Spiking neural networks (SNNs) have emerged as a promising tool for implementing binaural sound source localization approaches. However, optimizing the topology and size of SNNs is crucial to reduce computational costs while maintaining accuracy. This paper proposes a real-time structure of a reservoir SNN (rSNN) called Adaptive-Resonance-Theory-based rSNN (ART-rSNN) for localizing sound sources in the time domain by integrating an energy-based localization method. The dataset used in this work is recorded by two different omnidirectional microphones from a real environment. The dataset includes various sound events such as speech, music, and environmental sounds. The proposed ART-rSNN architecture can dynamically adjust the location of its neurons to amplify estimated energy near the sound source, resulting in higher localization accuracy. Our proposed method outperforms several conventional and state of the art algorithms in terms of accuracy and is able to detect the front and back direction of azimuth angle. This work demonstrates the potential of dynamic neuron arrangements in SNNs for improving sound source localization in practical applications.