Approximate matching in the L 1 metric

A. Amihood, O Lipsky, E Porat, J Umanski

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Approximate matching is one of the fundamental problems in pattern matching, and a ubiquitous problem in real applications. The Hamming distance is a simple and well studied example of approximate matching, motivated by typing, or noisy channels. Biological and image processing applications assign a different value to mismatches of different symbols. We consider the problem of approximate matching in the L 1 metric – the k- L 1 -distance problem. Given text T=t 0,...,t n − 1 and pattern P=p 0,...,p m − 1 strings of natural number, and a natural number k, we seek all text locations i where the L 1 distance of the pattern from the length m substring of text starting at i is not greater than k, i.e. ∑m−1j=0|ti+j−pj|≤k∑j=0m−1|ti+j−pj|≤k. We provide an algorithm that solves the k-L 1-distance problem in time O(nklogk−−−−−√)O(nklog⁡k). The algorithm applies a bounded divide-and-conquer approach and makes novel uses of non-boolean convolutions.
Original languageAmerican English
Title of host publicationAnnual Symposium on Combinatorial Pattern Matching
EditorsAlberto Apostolico, Maxime Crochemore, Kunsoo Park
PublisherSpringer Berlin Heidelberg
StatePublished - 2005

Bibliographical note

Place of conference:Korea

Fingerprint

Dive into the research topics of 'Approximate matching in the L 1 metric'. Together they form a unique fingerprint.

Cite this