학술논문

Efficient transform coding of two-channel audio signals by means of complex-valued stereo prediction
Document Type
Conference
Source
2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on. :497-500 May, 2011
Subject
Signal Processing and Analysis
Communication, Networking and Broadcast Technologies
Computing and Processing
Decoding
Transforms
Complexity theory
Audio coding
Transform coding
Compaction
MDCT
M/S stereo
prediction
Language
ISSN
1520-6149
2379-190X
Abstract
Traditional MDCT-based perceptual audio coding schemes employ mid/side and intensity stereo techniques to allow efficient joint coding of the two channels of a stereophonic signal. These techniques, however, provide only little coding gain for critical stereo signals characterized by spectral components with a distinct level or phase difference between the channels. To overcome this deficiency, we propose an extension to the mid/side coding paradigm that utilizes complex-valued inter-channel linear prediction in the MDCT spectral domain. The required imaginary spectrum (MDST) is calculated in a computationally efficient manner without additional algorithmic delay. A formal listening test conducted in the course of the ISO/MPEG standardization of the unified speech and audio codec USAC illustrates that the proposed stereo prediction approach provides significant improvements in coding efficiency and shows that at 96 kb/s, excellent quality can be obtained even for critical signals.