The string-to-dictionary matching problem

Shmuel T. Klein, Dana Shapira

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

The String-to-Dictionary Matching Problem is defined, in which a string is searched for in all the possible concatenations of the elements of a given dictionary, with applications to compressed matching in variable to fixed length encodings, such as Tunstall's. An algorithm based on suffix trees is suggested and experiments on natural language text are presented suggesting that compressed search might use less comparisons for long enough patterns, in spite of a potentially large number of encodings.

Original languageEnglish
Title of host publicationProceedings - DCC 2011
Subtitle of host publication2011 Data Compression Conference
Pages143-152
Number of pages10
DOIs
StatePublished - 2011
Event2011 Data Compression Conference, DCC 2011 - Snowbird, UT, United States
Duration: 29 Mar 201131 Mar 2011

Publication series

NameData Compression Conference Proceedings
ISSN (Print)1068-0314

Conference

Conference2011 Data Compression Conference, DCC 2011
Country/TerritoryUnited States
CitySnowbird, UT
Period29/03/1131/03/11

Bibliographical note

Place of conference:USA

Keywords

  • Compressed Matching
  • Suffix Trees
  • Tunstall

Fingerprint

Dive into the research topics of 'The string-to-dictionary matching problem'. Together they form a unique fingerprint.

Cite this