Improved techniques for processing queries in full-text systems

Y. Choueka, A. S. Fraenkel, S. T. Klein, E. Segal

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

24 Scopus citations

Abstract

In static full-text retrieval systems, which accommodate metrical as well as Boolean operators, the traditional approach to query processing uses a "concordance", from which large sets of coordinates are retrieved and then merged and/or collated. Alternatively, in a system with £ documents, the concordance can be replaced by a set of bit-maps of fixed length £, which are constructed for every different word of the database and serve as occurrence maps. We propose to combine the concordance and bit-map approaches, and show how this can speed up the processing of queries: fast ANDing and ORing of the maps in a preprocessing stage, lead to large I/O savings in collating coordinates of keywords needed to satisfy the metrical and Boolean constraints. Moreover, the bit-maps give partial information on the distribution of the coordinates of the keywords, which can be used when queries must be processed by stages, due to their complexity and the sizes of the involved sets of coordinates. The new techniques are partially implemented at the Responsa Retrieval Project.

Original languageEnglish
Title of host publicationProceedings of the 10th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1987
EditorsC.T. Yu, C. Van Rijsbergen
PublisherAssociation for Computing Machinery, Inc
Pages306-315
Number of pages10
ISBN (Electronic)0897912322, 9780897912327
DOIs
StatePublished - 1 Nov 1987
Event10th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1987 - New Orleans, United States
Duration: 3 Jun 19875 Jun 1987

Publication series

NameProceedings of the 10th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1987

Conference

Conference10th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1987
Country/TerritoryUnited States
CityNew Orleans
Period3/06/875/06/87

Bibliographical note

Publisher Copyright:
© 1987 ACM.

Fingerprint

Dive into the research topics of 'Improved techniques for processing queries in full-text systems'. Together they form a unique fingerprint.

Cite this