학술논문

Hybrid Packet Loss Concealment for Real-Time Networked Music Applications
Document Type
article
Source
IEEE Open Journal of Signal Processing, Vol 5, Pp 266-273 (2024)
Subject
Audio signal processing
autoregressive models
machine learning
networked music performance
neural networks
packet loss concealment
Electrical engineering. Electronics. Nuclear engineering
TK1-9971
Language
English
ISSN
2644-1322
Abstract
Real-time audio communications over IP have become essential to our daily lives. Packet-switched networks, however, are inherently prone to jitter and data losses, thus creating a strong need for effective packet loss concealment (PLC) techniques. Though solutions based on deep learning have made significant progress in that direction as far as speech is concerned, extending the use of such methods to applications of Networked Music Performance (NMP) presents significant challenges, including high fidelity requirements, higher sampling rates, and stringent temporal constraints associated to the simultaneous interaction between remote musicians. In this article, we present PARCnet, a hybrid PLC method that utilizes a feed-forward neural network to estimate the time-domain residual signal of a parallel linear autoregressive model. Objective metrics and a listening test show that PARCnet provides state-of-the-art results while enabling real-time operation on CPU.