TY - GEN
T1 - How to catch l 2-heavy-hitters on sliding windows
AU - Braverman, Vladimir
AU - Gelles, Ran
AU - Ostrovsky, Rafail
PY - 2013
Y1 - 2013
N2 - Finding heavy-elements (heavy-hitters) in streaming data is one of the central, and well-understood tasks. Despite the importance of this problem, when considering the sliding windows model of streaming (where elements eventually expire) the problem of finding L 2-heavy elements has remained completely open despite multiple papers and considerable success in finding L 1-heavy elements. Since the L 2-heavy element problem doesn't satisfy certain conditions, existing methods for sliding windows algorithms, such as smooth histograms or exponential histograms are not directly applicable to it. In this paper, we develop the first polylogarithmic-memory algorithm for finding L 2-heavy elements in the sliding window model. Our technique allows us not only to find L 2-heavy elements, but also heavy elements with respect to any L p with 0 < p ≤ 2 on sliding windows. By this we completely close the gap and resolve the question of finding L p-heavy elements in the sliding window model with polylogarithmic memory, since it is well known that for p > 2 this task is impossible. We demonstrate a broader applicability of our method on two additional examples: we show how to obtain a sliding window approximation of the similarity of two streams, and of the fraction of elements that appear exactly a specified number of times within the window (the α-rarity problem). In these two illustrative examples of our method, we replace the current expected memory bounds with worst case bounds.
AB - Finding heavy-elements (heavy-hitters) in streaming data is one of the central, and well-understood tasks. Despite the importance of this problem, when considering the sliding windows model of streaming (where elements eventually expire) the problem of finding L 2-heavy elements has remained completely open despite multiple papers and considerable success in finding L 1-heavy elements. Since the L 2-heavy element problem doesn't satisfy certain conditions, existing methods for sliding windows algorithms, such as smooth histograms or exponential histograms are not directly applicable to it. In this paper, we develop the first polylogarithmic-memory algorithm for finding L 2-heavy elements in the sliding window model. Our technique allows us not only to find L 2-heavy elements, but also heavy elements with respect to any L p with 0 < p ≤ 2 on sliding windows. By this we completely close the gap and resolve the question of finding L p-heavy elements in the sliding window model with polylogarithmic memory, since it is well known that for p > 2 this task is impossible. We demonstrate a broader applicability of our method on two additional examples: we show how to obtain a sliding window approximation of the similarity of two streams, and of the fraction of elements that appear exactly a specified number of times within the window (the α-rarity problem). In these two illustrative examples of our method, we replace the current expected memory bounds with worst case bounds.
UR - http://www.scopus.com/inward/record.url?scp=84884301083&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-38768-5_56
DO - 10.1007/978-3-642-38768-5_56
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84884301083
SN - 9783642387678
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 638
EP - 650
BT - Computing and Combinatorics - 19th International Conference, COCOON 2013, Proceedings
T2 - 19th International Computing and Combinatorics Conference, COCOON 2013
Y2 - 21 June 2013 through 21 June 2013
ER -