Abstract
Recently, a new pattern matching paradigm was proposed, pattern matching with address errors. In this paradigm approximate string matching problems are studied, where the content is unaltered and only the locations of the different entries may change. Specifically, a broad class of problems in this new paradigm was defined - the class of rearrangement errors. In this type of errors the pattern is transformed through a sequence of rearrangement operations, each with an associated cost. The natural ℓ1 and ℓ2 rearrangement systems were considered. A variant of the ℓ1- rearrangement distance problem seems more difficult - where the pattern is a general string that may have repeating symbols. The best algorithm presented for the general case is O(nm). In this paper, we show that even for general strings the problem can be approximated in linear time! This paper also considers another natural rearrangement system - the ℓ∞ rearrangement distance. For this new rearrangement system we provide efficient exact solutions for different variants of the problem, as well as a faster approximation.
Original language | English |
---|---|
Title of host publication | String Processing and Information Retrieval - 14th International Symposium, SPIRE 2007, Proceedings |
Publisher | Springer Verlag |
Pages | 39-49 |
Number of pages | 11 |
ISBN (Print) | 9783540755296 |
DOIs | |
State | Published - 2007 |
Event | 14th International Symposium on String Processing and Information Retrieval, SPIRE 2007 - Santiago, Chile Duration: 29 Oct 2007 → 31 Oct 2007 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 4726 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 14th International Symposium on String Processing and Information Retrieval, SPIRE 2007 |
---|---|
Country/Territory | Chile |
City | Santiago |
Period | 29/10/07 → 31/10/07 |
Bibliographical note
Funding Information:The authors wish to thank the anonymous referees who carefully read the original manuscript and provided comments that were invaluable to the flow and understandability of the paper. First author was partly supported by ISF grant 35/05.
Funding
The authors wish to thank the anonymous referees who carefully read the original manuscript and provided comments that were invaluable to the flow and understandability of the paper. First author was partly supported by ISF grant 35/05.
Funders | Funder number |
---|---|
Iowa Science Foundation | 35/05 |