학술논문

Convolutional recurrent neural networks for bird audio detection
Document Type
Conference
Source
2017 25th European Signal Processing Conference (EUSIPCO) Signal Processing Conference (EUSIPCO), 2017 25th European. :1744-1748 Aug, 2017
Subject
Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
Robotics and Control Systems
Signal Processing and Analysis
Birds
Convolution
Feature extraction
Acoustics
Recurrent neural networks
Time-frequency analysis
Electronic mail
Language
ISSN
2076-1465
Abstract
Bird sounds possess distinctive spectral structure which may exhibit small shifts in spectrum depending on the bird species and environmental conditions. In this paper, we propose using convolutional recurrent neural networks on the task of automated bird audio detection in real-life environments. In the proposed method, convolutional layers extract high dimensional, local frequency shift invariant features, while recurrent layers capture longer term dependencies between the features extracted from short time frames. This method achieves 88.5% Area Under ROC Curve (AUC) score on the unseen evaluation data and obtains the second place in the Bird Audio Detection challenge.