TY - JOUR

T1 - The frequent items problem, under polynomial decay, in the streaming model

AU - Feigenblat, Guy

AU - Itzhaki, Ofra

AU - Porat, Ely

PY - 2010/7/17

Y1 - 2010/7/17

N2 - We consider the problem of estimating the frequency count of data stream elements under polynomial decay functions. In these settings every element in the stream is assigned with a time-decreasing weight, using a non-increasing polynomial function. Decay functions are used in applications where older data is less significant, less interesting or even less reliable than recent data. Consider a data stream of N elements drawn from a universe U. We propose three poly-logarithmic algorithms for the problem. The first one, deterministic, uses O(1/2logN(loglogN+logU)) bits, where ∈(0,1) is the approximation parameter. The second one, probabilistic, uses O(12logNδlog1) bits or O(12logNδlogN) bits, depending on the decay function parameter, where δ∈(0,1) is the probability of failure. The third one, deterministic in the stochastic model, uses O(1logU) bits or O(12logN) bits, also depending on the decay parameter as will be described in this paper. This variant of the problem is important and has many applications. To our knowledge, it has never been studied before.

AB - We consider the problem of estimating the frequency count of data stream elements under polynomial decay functions. In these settings every element in the stream is assigned with a time-decreasing weight, using a non-increasing polynomial function. Decay functions are used in applications where older data is less significant, less interesting or even less reliable than recent data. Consider a data stream of N elements drawn from a universe U. We propose three poly-logarithmic algorithms for the problem. The first one, deterministic, uses O(1/2logN(loglogN+logU)) bits, where ∈(0,1) is the approximation parameter. The second one, probabilistic, uses O(12logNδlog1) bits or O(12logNδlogN) bits, depending on the decay function parameter, where δ∈(0,1) is the probability of failure. The third one, deterministic in the stochastic model, uses O(1logU) bits or O(12logN) bits, also depending on the decay parameter as will be described in this paper. This variant of the problem is important and has many applications. To our knowledge, it has never been studied before.

KW - Algorithms

KW - Frequency count

KW - Polynomial decay functions

KW - Streaming

UR - http://www.scopus.com/inward/record.url?scp=77955421441&partnerID=8YFLogxK

U2 - 10.1016/j.tcs.2010.04.029

DO - 10.1016/j.tcs.2010.04.029

M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???

AN - SCOPUS:77955421441

SN - 0304-3975

VL - 411

SP - 3048

EP - 3054

JO - Theoretical Computer Science

JF - Theoretical Computer Science

IS - 34-36

ER -