Efficient computations of ℓ1 and ℓ rearrangement distances

Amihood Amir, Yonatan Aumann, Piotr Indyk, Avivit Levy, Ely Porat

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

13 Scopus citations

Abstract

Recently, a new pattern matching paradigm was proposed, pattern matching with address errors. In this paradigm approximate string matching problems are studied, where the content is unaltered and only the locations of the different entries may change. Specifically, a broad class of problems in this new paradigm was defined - the class of rearrangement errors. In this type of errors the pattern is transformed through a sequence of rearrangement operations, each with an associated cost. The natural ℓ1 and ℓ2 rearrangement systems were considered. A variant of the ℓ1- rearrangement distance problem seems more difficult - where the pattern is a general string that may have repeating symbols. The best algorithm presented for the general case is O(nm). In this paper, we show that even for general strings the problem can be approximated in linear time! This paper also considers another natural rearrangement system - the ℓ rearrangement distance. For this new rearrangement system we provide efficient exact solutions for different variants of the problem, as well as a faster approximation.

Original languageEnglish
Title of host publicationString Processing and Information Retrieval - 14th International Symposium, SPIRE 2007, Proceedings
PublisherSpringer Verlag
Pages39-49
Number of pages11
ISBN (Print)9783540755296
DOIs
StatePublished - 2007
Event14th International Symposium on String Processing and Information Retrieval, SPIRE 2007 - Santiago, Chile
Duration: 29 Oct 200731 Oct 2007

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4726 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Symposium on String Processing and Information Retrieval, SPIRE 2007
Country/TerritoryChile
CitySantiago
Period29/10/0731/10/07

Bibliographical note

Funding Information:
The authors wish to thank the anonymous referees who carefully read the original manuscript and provided comments that were invaluable to the flow and understandability of the paper. First author was partly supported by ISF grant 35/05.

Funding

The authors wish to thank the anonymous referees who carefully read the original manuscript and provided comments that were invaluable to the flow and understandability of the paper. First author was partly supported by ISF grant 35/05.

FundersFunder number
Iowa Science Foundation35/05

    Fingerprint

    Dive into the research topics of 'Efficient computations of ℓ1 and ℓ rearrangement distances'. Together they form a unique fingerprint.

    Cite this