학술논문

Deep Autotuner: A Pitch Correcting Network for Singing Performances

Document Type

Conference

Author

Wager, Sanna; Tzanetakis, George; Wang, Cheng-i; Kim, Minje

Source

ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2020 - 2020 IEEE International Conference on. :246-250 May, 2020

Subject

Signal Processing and Analysis
Tracking
Neural networks
Music
Multiple signal classification
Task analysis
Speech processing
Spectrogram
music information retrieval
singing voice
automatic pitch correction
deep learning
autotuning

Language

ISSN

2379-190X

Abstract

We introduce a data-driven approach to automatic pitch correction of solo singing performances. The proposed approach predicts note-wise pitch shifts from the relationship between the respective spectrograms of the singing and accompaniment. This approach differs from commercial systems, where vocal track notes are usually shifted to be centered around pitches in a user-defined score, or mapped to the closest pitch among the twelve equal-tempered scale degrees. The proposed system treats pitch as a continuous value rather than relying on a set of discretized notes found in musical scores, thus allowing for improvisation and harmonization in the singing performance. We train our neural network model using a dataset of 4,702 amateur karaoke performances selected for good intonation. Our model is trained on both incorrect intonation, for which it learns a correction, and intentional pitch variation, which it learns to preserve. The proposed deep neural network with gated recurrent units on top of convolutional layers shows promising performance on the real-world score-free singing pitch correction task—autotuning.

Online Access

Full Text (IEEE) Find it@PNU

이메일

부산대학교 도서관

Online Access

메일 발송