Linear Time Reconstruction of Parameterized Strings from Parameterized Suffix and LCP Arrays for Constant-Sized Alphabets

Amihood Amir, Eitan Kondratovsky, Shoshana Marcus, Dina Sokol

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A parameterized string (p-string) is a string that can contain two kinds of characters, static symbols and parameter characters. Parameterized pattern matching is a form of pattern matching that allows parameters to be renamed by applying a one-to-one function. The parameterized suffix array is a data structure that is useful in efficient parameterized pattern matching when accompanied by the parameterized longest common prefix (LCP) array. Reconstructing input from a given instance of a data structure is the task of determining whether the instance is valid or not, and if valid, producing a plausible set of data that it can represent. In this paper we consider parameterized suffix and LCP arrays and reconstruct a corresponding p-string that they can represent. In previous work, an algorithm can determine in O(n2) time whether a p-string can be constructed to correspond to the input parameterized suffix and LCP arrays of size n. In this work, we develop an algorithm that accomplishes this in O(n) time for constant-sized alphabets, and O(nlogn) time for general alphabets. Furthermore, when reconstruction is possible, we demonstrate that a p-string can be reconstructed over the minimal alphabet in O(n2) time.

Original languageEnglish
Title of host publicationString Processing and Information Retrieval - 31st International Symposium, SPIRE 2024, Proceedings
EditorsZsuzsanna Lipták, Edleno Moura, Karina Figueroa, Ricardo Baeza-Yates
PublisherSpringer Science and Business Media Deutschland GmbH
Pages1-15
Number of pages15
ISBN (Print)9783031721991
DOIs
StatePublished - 2025
Event31st International Symposium on String Processing and Information Retrieval, SPIRE 2024 - Puerto Vallarta, Mexico
Duration: 23 Sep 202425 Sep 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14899 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference31st International Symposium on String Processing and Information Retrieval, SPIRE 2024
Country/TerritoryMexico
CityPuerto Vallarta
Period23/09/2425/09/24

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

Keywords

  • Longest Common Prefix Array
  • Parameterized Strings
  • Reverse Engineering
  • Strings
  • Suffix Array

Fingerprint

Dive into the research topics of 'Linear Time Reconstruction of Parameterized Strings from Parameterized Suffix and LCP Arrays for Constant-Sized Alphabets'. Together they form a unique fingerprint.

Cite this