Extending the applicability of the Zipf's laws to the sequences of byte data
Vestnik Sankt-Peterburgskogo universiteta. Prikladnaâ matematika, informatika, processy upravleniâ, Tome 20 (2024) no. 3, pp. 391-403

Voir la notice de l'article provenant de la source Math-Net.Ru

Zipf's law have been shown to hold true in many places. From it's first idea of a statistical phenomenon related to natural language to it's later adaptations for economical, social and many other fields, it has been shown to work almost universally. In all of these cases authors discuss the applicability of the Zipf's law in terms of semantically complex structures. We take this notion a step further and show how this law can work for data analysis, in particular for the sequences of byte data, obtained from various sources. We show that, using the basic chunking methodology, the Zipf's law can be shown to hold true for many different types of raw sequences of byte data. In particular, the law holds true in all caes for the "middle point’’ of data, where it is present with a degree of certainty of more than 90 %. We conclude by discussing the implications and potential use cases of these findings.
Keywords: Zipf's laws, byte data, chunking, frequency analysis.
@article{VSPUI_2024_20_3_a6,
     author = {S. L. Sergeev and I. S. Blekanov and F. V. Ezhov and N. A. Tarasov},
     title = {Extending the applicability of the {Zipf's} laws to the sequences of byte data},
     journal = {Vestnik Sankt-Peterburgskogo universiteta. Prikladna\^a matematika, informatika, processy upravleni\^a},
     pages = {391--403},
     publisher = {mathdoc},
     volume = {20},
     number = {3},
     year = {2024},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/VSPUI_2024_20_3_a6/}
}
TY  - JOUR
AU  - S. L. Sergeev
AU  - I. S. Blekanov
AU  - F. V. Ezhov
AU  - N. A. Tarasov
TI  - Extending the applicability of the Zipf's laws to the sequences of byte data
JO  - Vestnik Sankt-Peterburgskogo universiteta. Prikladnaâ matematika, informatika, processy upravleniâ
PY  - 2024
SP  - 391
EP  - 403
VL  - 20
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/VSPUI_2024_20_3_a6/
LA  - en
ID  - VSPUI_2024_20_3_a6
ER  - 
%0 Journal Article
%A S. L. Sergeev
%A I. S. Blekanov
%A F. V. Ezhov
%A N. A. Tarasov
%T Extending the applicability of the Zipf's laws to the sequences of byte data
%J Vestnik Sankt-Peterburgskogo universiteta. Prikladnaâ matematika, informatika, processy upravleniâ
%D 2024
%P 391-403
%V 20
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/VSPUI_2024_20_3_a6/
%G en
%F VSPUI_2024_20_3_a6
S. L. Sergeev; I. S. Blekanov; F. V. Ezhov; N. A. Tarasov. Extending the applicability of the Zipf's laws to the sequences of byte data. Vestnik Sankt-Peterburgskogo universiteta. Prikladnaâ matematika, informatika, processy upravleniâ, Tome 20 (2024) no. 3, pp. 391-403. http://geodesic.mathdoc.fr/item/VSPUI_2024_20_3_a6/