How to catch L2-heavy-hitters on sliding windows

Vladimir Braverman, Ran Gelles, Rafail Ostrovsky

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

Finding heavy-elements (heavy-hitters) in streaming data is one of the central, and well-understood tasks. Despite the importance of this problem, when considering the sliding windows model of streaming (where elements eventually expire) the problem of finding L2-heavy elements has remained completely open despite multiple papers and considerable success in finding L1-heavy elements. Since the L2-heavy element problem doesn't satisfy certain conditions, existing methods for sliding windows algorithms, such as smooth histograms or exponential histograms are not directly applicable to it. In this paper, we develop the first polylogarithmic-memory algorithm for finding L2-heavy elements in the sliding window model. Our technique allows us not only to find L2-heavy elements, but also heavy elements with respect to any Lp with 0<p≤2 on sliding windows. By this we completely "close the gap" and resolve the question of finding Lp-heavy elements in the sliding window model with polylogarithmic memory, since it is well known that for p>2 this task is impossible. We demonstrate a broader applicability of our method on two additional examples: we show how to obtain a sliding window approximation of the similarity of two streams, and of the fraction of elements that appear exactly a specified number of times within the window (the α-rarity problem). In these two illustrative examples of our method, we replace the current expected memory bounds with worst case bounds.

Original languageEnglish
Pages (from-to)82-94
Number of pages13
JournalTheoretical Computer Science
Volume554
Issue numberC
DOIs
StatePublished - 2014
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2014 Elsevier B.V.

Funding

V.B. is supported in part by DARPA grant N660001-1-2-4014 . R.O. is supported in part by NSF grants CNS-0830803 ; CCF-0916574 ; IIS-1065276 ; CCF-1016540 ; CNS-1118126 ; CNS-1136174 ; US-Israel BSF grant 2008411 , OKAWA Foundation Research Award , IBM Faculty Research Award , Xerox Faculty Research Award , B. John Garrick Foundation Award , Teradata Research Award , and Lockheed-Martin Corporation Research Award . This material is also based upon work supported by the Defense Advanced Research Projects Agency through the US Office of Naval Research under Contract N00014-11-1-0392 . The views expressed are those of the author and do not reflect the official policy or position of the Department of Defense or the US Government.

FundersFunder number
US-Israel BSF2008411
National Science FoundationCNS-1136174, IIS-1065276, CCF-1016540, CNS-1118126, CCF-0916574, CNS-0830803
Office of Naval ResearchN00014-11-1-0392
Directorate for Computer and Information Science and Engineering0716389, 0830803, 0716835, 0916574
Defense Advanced Research Projects AgencyN660001-1-2-4014
International Business Machines Corporation

    Keywords

    • Approximation algorithms
    • Data streams
    • Heavy hitters
    • Sliding window model

    Fingerprint

    Dive into the research topics of 'How to catch L2-heavy-hitters on sliding windows'. Together they form a unique fingerprint.

    Cite this