Best wavelet-packet bases for audio coding using perceptual and rate-distortion criteria

Markus Erne, George Moschytz, Christof Faller

Research output: Contribution to journalConference articlepeer-review

14 Scopus citations

Abstract

This paper presents a new approach to the adaptation of a wavelet filterbank based on perceptual and rate-distortion criteria. The system makes use of a wavelet-packet transform where each subband can have an individual time-segmentation. Boundary effects can be avoided by using overlapping blocks of samples and therefore switching bases is possible at every tree-level without affecting other subbands. A modified psychoacoustic model using perceptual entropy can control the switching of the wavelet filterbank and the individual time-segmentation of every subband allows to take advantage of temporal masking. Additionally a rate-distortion measure can control the filterbank for lossless audio coding applications or in cases where large coding gains can be achieved without using perceptual criteria. The weight of the perceptual measure as well as the weight of the rate-distortion measure can be selected individually, enabling to trade lossless coding versus perceptual coding.

Original languageEnglish
Pages (from-to)909-912
Number of pages4
JournalProceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
Volume2
DOIs
StatePublished - 1999
Externally publishedYes
EventProceedings of the 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-99) - Phoenix, AZ, USA
Duration: 15 Mar 199919 Mar 1999

Fingerprint

Dive into the research topics of 'Best wavelet-packet bases for audio coding using perceptual and rate-distortion criteria'. Together they form a unique fingerprint.

Cite this