TY - JOUR
T1 - Approximate matching in the L1 metric
AU - Amir, Amihood
AU - Lipsky, Ohad
AU - Porat, Ely
AU - Umanski, Julia
PY - 2005
Y1 - 2005
N2 - Approximate matching is one of the fundamental problems in pattern matching, and a ubiquitous problem in real applications. The Hamming distance is a simple and well studied example of approximate matching, motivated by typing, or noisy channels. Biological and image processing applications assign a different value to mismatches of different symbols. We consider the problem of approximate matching in the L1 metric - the k-L1-distance problem. Given text T = to, ..., tn-1 and pattern P = po, ..., pm-1 strings of natural number, and a natural number k, we seek all text locations i where the L1 distance of the pattern from the length m substring of text starting at i is not greater than k, i.e. ∑j=0m-1 |ti+j-pj| ≤ k. We provide an algorithm that solves the k-L1 -distance problem in time O(n √k log k). The algorithm applies a bounded divide-and-conquer approach and makes noveluses of non-boolean convolutions.
AB - Approximate matching is one of the fundamental problems in pattern matching, and a ubiquitous problem in real applications. The Hamming distance is a simple and well studied example of approximate matching, motivated by typing, or noisy channels. Biological and image processing applications assign a different value to mismatches of different symbols. We consider the problem of approximate matching in the L1 metric - the k-L1-distance problem. Given text T = to, ..., tn-1 and pattern P = po, ..., pm-1 strings of natural number, and a natural number k, we seek all text locations i where the L1 distance of the pattern from the length m substring of text starting at i is not greater than k, i.e. ∑j=0m-1 |ti+j-pj| ≤ k. We provide an algorithm that solves the k-L1 -distance problem in time O(n √k log k). The algorithm applies a bounded divide-and-conquer approach and makes noveluses of non-boolean convolutions.
UR - http://www.scopus.com/inward/record.url?scp=26444603465&partnerID=8YFLogxK
U2 - 10.1007/11496656_9
DO - 10.1007/11496656_9
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.conferencearticle???
AN - SCOPUS:26444603465
SN - 0302-9743
VL - 3537
SP - 91
EP - 103
JO - Lecture Notes in Computer Science
JF - Lecture Notes in Computer Science
T2 - Ot16th Annual Symposium on Combinatorial Pattern Matching, CPM 2005
Y2 - 19 June 2005 through 22 June 2005
ER -