Abstract
Although it is well-known that Huffman Codes are optimal for text compression in a character-per-chaxacter encoding scheme, they are seldom used in practical situations since they reqnire a bit-per-bit decoding algorithm, which has to be written is some assembly langnage, and will perform rather slowly. A number of methods are presented that avoid these difficulties. The decoding algorithms efficiently process the encoded string on a byte-per-byte basis, are faster than the original algorithm, and can be programmed in any high level langnage. This is achieved at the cost of storing some tables in the internal memory, but with no loss in the compression savings of the optimal Huffman codes. The internal memory space needed can be reduced either at the cost of increased processing time, or by using non-binary Huffman codes, which give sub-optimal compression. Experimental results for English and Hebrew text are also presented.
Original language | English |
---|---|
Title of host publication | Proceedings of the 8th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1985 |
Publisher | Association for Computing Machinery, Inc |
Pages | 122-130 |
Number of pages | 9 |
ISBN (Electronic) | 0897911598, 9780897911597 |
DOIs | |
State | Published - 5 Jun 1985 |
Event | 8th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1985 - Montreal, Canada Duration: 5 Jun 1985 → 7 Jun 1985 |
Publication series
Name | Proceedings of the 8th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1985 |
---|
Conference
Conference | 8th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1985 |
---|---|
Country/Territory | Canada |
City | Montreal |
Period | 5/06/85 → 7/06/85 |
Bibliographical note
Publisher Copyright:© Proceedings of the 1985 Annual ACM SIGAda International Conference on Ada, SIGAda 1985. All rights reserved.