Abstract
It seems reasonable to expect from a good compression method that its output should not be further compressible, because it should behave essentially like random data. We investigate this premise for a variety of known compression techniques, and find that, surprisingly, there is much variability in the randomness, depending on the chosen method. Arithmetic coding seems to produce perfectly random output, whereas that of Huffman or Ziv-Lempel coding still contains many dependencies. In particular, the output of Huffman coding has already been proven to be random under certain conditions, and we show here that arithmetic coding may produce an output that is identical to that of Huffman.
Original language | English |
---|---|
Title of host publication | Proceedings - DCC 2019 |
Subtitle of host publication | 2019 Data Compression Conference |
Editors | James A. Storer, Joan Serra-Sagrista, Ali Bilgin, Michael W. Marcellin |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 581 |
Number of pages | 1 |
ISBN (Electronic) | 9781728106571 |
DOIs | |
State | Published - 10 May 2019 |
Event | 2019 Data Compression Conference, DCC 2019 - Snowbird, United States Duration: 26 Mar 2019 → 29 Mar 2019 |
Publication series
Name | Data Compression Conference Proceedings |
---|---|
Volume | 2019-March |
ISSN (Print) | 1068-0314 |
Conference
Conference | 2019 Data Compression Conference, DCC 2019 |
---|---|
Country/Territory | United States |
City | Snowbird |
Period | 26/03/19 → 29/03/19 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.
Keywords
- Kullback Leibler
- Lossless Compression
- Randomness