Abstract
We revisit Nisan's classical pseudorandom generator (PRG) for space-bounded computation (STOC 1990) and its applications in streaming algorithms. We describe a new generator, HashPRG, that can be thought of as a symmetric version of Nisan's generator over larger alphabets. Our generator allows a trade-off between seed length and the time needed to compute a given block of the generator's output. HashPRG can be used to obtain derandomizations with much better update time and without sacrificing space for a large number of data stream algorithms, for example:•Andoni's F_p estimation algorithm for constant p > 2 (ICASSP, 2017) assumes a random oracle, but achieves optimal space and constant update time. Using HashPRG's time-space trade-off we eliminate the random oracle assumption while preserving the other properties. Previously no time-optimal derandomization was known. Using similar techniques, we give an algorithm for a relaxed version of ℓ_p sampling in a turnstile stream. Both of our algorithms use Õ(d1-2/p) bits of space and have O(1) update time.•For 0< p<2, the 1 pm ϵ approximate F_p estimation algorithm of Kane et al., (STOC, 2011) uses an optimal O(ϵ-2 log d) bits of space but has an update time of O(log 2(1/ϵ) log log (1/ϵ)). Using HashPRG, we show that if 1/√d ≤ ϵ ≤ 1/dc for an arbitrarily small constant c > 0, then we can obtain a 1 pm ϵ approximate F_p estimation algorithm that uses the optimal O(ϵ-2 log d) bits of space and has an update time of O(log d) in the Word RAM model, which is more than a quadratic improvement in the update time. We obtain similar improvements for entropy estimation.•CountSketch, with the fine-grained error analysis of Minton and Price (SODA, 2014). For derandomization, they suggested a direct application of Nisan's generator, yielding a logarithmic multiplicative space overhead. With HashPRG we obtain an efficient derandomization yielding the same asymptotic space as when assuming a random oracle. Our ability to obtain a time-efficient derandomization makes crucial use of HashPRG's symmetry. We also give the first derandomization of a recent private version of CountSketch.For a d-dimensional vector x being updated in a turnstile stream, we show that ||x||∞ can be estimated up to an additive error of ϵ||x||_2 using O(ϵ-2 log (1/ϵ) log d) bits of space. Additionally, the update time of this algorithm is O(log 1/ϵ) in the Word RAM model. We show that the space complexity of this algorithm is optimal up to constant factors. However, for vectors x with ||x||∞=Θ(||x||_2), we show that the lower bound can be broken by giving an algorithm that uses O(ϵ-2 log d) bits of space which approximates ||x||∞ up to an additive error of ϵ||x||_2. We use our aforementioned derandomization of the CountSketch data structure to obtain this algorithm, and using the time-space trade off of HashPRG, we show that the update time of this algorithm is also O(log 1/ϵ) in the Word RAM model.
Original language | English |
---|---|
Title of host publication | Proceedings - 2023 IEEE 64th Annual Symposium on Foundations of Computer Science, FOCS 2023 |
Publisher | IEEE Computer Society Press |
Publication date | 2023 |
Pages | 1515-1550 |
ISBN (Electronic) | 9798350318944 |
DOIs | |
Publication status | Published - 2023 |
Event | 64th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2023 - Santa Cruz, United States Duration: 6 Nov 2023 → 9 Nov 2023 |
Conference
Conference | 64th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2023 |
---|---|
Country/Territory | United States |
City | Santa Cruz |
Period | 06/11/2023 → 09/11/2023 |
Sponsor | IEEE, IEEE Computer Society, IEEE Computer Society Technical Committee on Mathematical Foundations of Computing, NSF |
Series | Proceedings - Annual IEEE Symposium on Foundations of Computer Science, FOCS |
---|---|
ISSN | 0272-5428 |
Bibliographical note
Publisher Copyright:© 2023 IEEE.
Keywords
- Fast Update time
- Pseudorandom Generators
- Streaming Algorithms