TY - JOUR
T1 - The frequent items problem, under polynomial decay, in the streaming model
AU - Feigenblat, Guy
AU - Itzhaki, Ofra
AU - Porat, Ely
PY - 2010/7/17
Y1 - 2010/7/17
N2 - We consider the problem of estimating the frequency count of data stream elements under polynomial decay functions. In these settings every element in the stream is assigned with a time-decreasing weight, using a non-increasing polynomial function. Decay functions are used in applications where older data is less significant, less interesting or even less reliable than recent data. Consider a data stream of N elements drawn from a universe U. We propose three poly-logarithmic algorithms for the problem. The first one, deterministic, uses O(1/2logN(loglogN+logU)) bits, where ∈(0,1) is the approximation parameter. The second one, probabilistic, uses O(12logNδlog1) bits or O(12logNδlogN) bits, depending on the decay function parameter, where δ∈(0,1) is the probability of failure. The third one, deterministic in the stochastic model, uses O(1logU) bits or O(12logN) bits, also depending on the decay parameter as will be described in this paper. This variant of the problem is important and has many applications. To our knowledge, it has never been studied before.
AB - We consider the problem of estimating the frequency count of data stream elements under polynomial decay functions. In these settings every element in the stream is assigned with a time-decreasing weight, using a non-increasing polynomial function. Decay functions are used in applications where older data is less significant, less interesting or even less reliable than recent data. Consider a data stream of N elements drawn from a universe U. We propose three poly-logarithmic algorithms for the problem. The first one, deterministic, uses O(1/2logN(loglogN+logU)) bits, where ∈(0,1) is the approximation parameter. The second one, probabilistic, uses O(12logNδlog1) bits or O(12logNδlogN) bits, depending on the decay function parameter, where δ∈(0,1) is the probability of failure. The third one, deterministic in the stochastic model, uses O(1logU) bits or O(12logN) bits, also depending on the decay parameter as will be described in this paper. This variant of the problem is important and has many applications. To our knowledge, it has never been studied before.
KW - Algorithms
KW - Frequency count
KW - Polynomial decay functions
KW - Streaming
UR - http://www.scopus.com/inward/record.url?scp=77955421441&partnerID=8YFLogxK
U2 - 10.1016/j.tcs.2010.04.029
DO - 10.1016/j.tcs.2010.04.029
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:77955421441
SN - 0304-3975
VL - 411
SP - 3048
EP - 3054
JO - Theoretical Computer Science
JF - Theoretical Computer Science
IS - 34-36
ER -