On Suffix Tree Detection

Amihood Amir, Eitan Kondratovsky, Avivit Levy

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

A suffix tree is a fundamental data structure for string processing and information retrieval, however, its structure is still not well understood. The suffix trees reverse engineering problem, which its research aims at reducing this gap, is the following. Given an ordered rooted tree T with unlabeled edges, determine whether there exists a string w such that the unlabeled-edges suffix tree of w is isomorphic to T. Previous studies on this problem consider the relaxation of having the suffix links as well as assume a binary alphabet. This paper is the first to consider the suffix tree detection problem, in which the relaxation of having suffix links as input is removed. We study suffix tree detection on two scenarios that are interesting per se. We provide a suffix tree detection algorithm for general alphabet periodic strings. Given an ordered tree T with n leaves, our detection algorithm takes $$O(n+|\varSigma |^p)$$ -time, where p is the unknown in advance length of a period that repeats at least 3 times in a string S having a suffix tree structure identical to T, if such S exists. Therefore, it is a polynomial time algorithm if p is a constant and a linear time algorithm if, in addition, the alphabet has a sub-linear size. We also show some necessary (but insufficient) conditions for binary alphabet general strings suffix tree detection. By this we take another step towards understanding suffix trees structure.

Original languageEnglish
Title of host publicationString Processing and Information Retrieval - 30th International Symposium, SPIRE 2023, Proceedings
EditorsFranco Maria Nardini, Nadia Pisanti, Rossano Venturini
PublisherSpringer Science and Business Media Deutschland GmbH
Pages14-27
Number of pages14
ISBN (Print)9783031439797
DOIs
StatePublished - 2023
Event30th International Symposium on String Processing and Information Retrieval, SPIRE 2023 - Pisa, Italy
Duration: 26 Sep 202328 Sep 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14240 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference30th International Symposium on String Processing and Information Retrieval, SPIRE 2023
Country/TerritoryItaly
CityPisa
Period26/09/2328/09/23

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.

Funding

Partly supported by ISF grant 1475/18 and BSF grant 2018141.

FundersFunder number
United States-Israel Binational Science Foundation2018141
Israel Science Foundation1475/18

    Keywords

    • Periodic string
    • Reverse engineering
    • Suffix tree
    • Suffix tree detection

    Fingerprint

    Dive into the research topics of 'On Suffix Tree Detection'. Together they form a unique fingerprint.

    Cite this