Generative models for bitmap sets with compression applications

Abraham Bookstein, Shmuel T. Klein

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

In large IR systems, information about word occurrence may be stored as a bit matrix, with rows corresponding to different words and columns to documents. Such a matrix is generally very large and very sparse. New methods for compressing such matrices are presented, which exploit possible correlations between rows and between columns. The methods are based on partitioning the matrix into small blocks and predicting the 1-bit distribution within a block by means of various bit generation models. Each block is then encoded using Huffman or arithmetic coding. Preliminary experimental results indicate improvements over previous methods.

Original languageEnglish
Title of host publicationProceedings of the 14th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1991
PublisherAssociation for Computing Machinery, Inc
Pages63-71
Number of pages9
ISBN (Print)0897914481, 9780897914482
DOIs
StatePublished - 1 Sep 1991
Event14th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1991 - Chicago, United States
Duration: 13 Oct 199116 Oct 1991

Publication series

NameProceedings of the 14th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1991

Conference

Conference14th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1991
Country/TerritoryUnited States
CityChicago
Period13/10/9116/10/91

Bibliographical note

Publisher Copyright:
© 1991 ACM. All rights reserved.

Fingerprint

Dive into the research topics of 'Generative models for bitmap sets with compression applications'. Together they form a unique fingerprint.

Cite this