Abstract
For a given text which has been encoded by a static Huffman code, the possibility of locating a given pattern directly in the compressed text is investigated. The main problem is one of synchronization, as an occurrence of the encoded pattern in the encoded text does not necessarily correspond to an occurrence of the pattern in the text. A simple algorithm is suggested which reduces the number of erroneously declared matches. The probability of such false matches is analyzed and empirically tested.
Original language | English |
---|---|
Pages (from-to) | 829-841 |
Number of pages | 13 |
Journal | Information Processing and Management |
Volume | 41 |
Issue number | 4 |
DOIs | |
State | Published - Jul 2005 |
Keywords
- Compressed matching
- Data compression
- Huffman codes
- Pattern matching