Several explicit stochastic processes are known to satisfy Hilberg's law, a power-law growth of block entropy conjectured for natural language and recently connected to the neural scaling law. Existing examples either possess a positive Shannon entropy rate, are non-ergodic, or require comparatively involved constructions. We introduce multiperiodic processes, a new class of stationary ergodic processes over the natural numbers generated by random shifts of deterministic multiperiodic sequences. Under mild conditions, multiperiodic processes have vanishing Shannon entropy rate and, under a suitable parameterization, they satisfy both Zipf's law for symbol frequencies and Hilberg's law for block entropy. Since multiperiodic processes are not mixing, we identify the open problem of constructing an elementary strongly mixing source with vanishing entropy rate and Hilberg's law.
Multiperiodic Processes: Ergodic Sources with a Sublinear Entropy
Several explicit stochastic processes are known to satisfy Hilberg's law, a power-law growth of block entropy conjectured for natural language and recently connected to the neural scaling law.
- Year
- 2023
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2302.09049ARXIV-DEFAULT
- TL;DR
- Semantic Scholar