Frequency distributions in bioinformatics: the development
Proceedings of the Yerevan State University. Physical and mathematical sciences, no. 3 (2010), pp. 3-22.

Voir la notice de l'article provenant de la source Math-Net.Ru

The mathematical investigation of large-scale biomolecular sequences is being carried out by analyzing the properties of events arising in such sequences. This survey is devoted to discussion of results in this field. Based on general empirical facts being fulfilled for all frequency distributions, we discuss the axiomatics suggested by J. Astola and E. Danielyan. The axiomatics postulates the regular variation of frequency distribution with asymptotically constant slowly varying component, the form of its shape, and the stability by parameters. The verification of the axiomatics fulfillment for well-known frequency distributions is done. The paper describes also methods of construction of new parametric families of frequency distributions. These methods are: usage of stationary distributions of birth-death process, special functions, stable densities, etc. The problem of stability by parameters is formulated the results on stability by parameters in terms of various classical metrics are given. The conditions of regular variation for different families of frequency distributions are formulated.
Keywords: frequency distributbiomion, olecular sequence, regular variation, convexity, stability by parameters, asymptotic expansion.
@article{UZERU_2010_3_a0,
     author = {J. T. Astola and E. A. Danielyan and S. K. Arzumanyan},
     title = {Frequency distributions in bioinformatics: the development},
     journal = {Proceedings of the Yerevan State University. Physical and mathematical sciences},
     pages = {3--22},
     publisher = {mathdoc},
     number = {3},
     year = {2010},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/UZERU_2010_3_a0/}
}
TY  - JOUR
AU  - J. T. Astola
AU  - E. A. Danielyan
AU  - S. K. Arzumanyan
TI  - Frequency distributions in bioinformatics: the development
JO  - Proceedings of the Yerevan State University. Physical and mathematical sciences
PY  - 2010
SP  - 3
EP  - 22
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/UZERU_2010_3_a0/
LA  - en
ID  - UZERU_2010_3_a0
ER  - 
%0 Journal Article
%A J. T. Astola
%A E. A. Danielyan
%A S. K. Arzumanyan
%T Frequency distributions in bioinformatics: the development
%J Proceedings of the Yerevan State University. Physical and mathematical sciences
%D 2010
%P 3-22
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/UZERU_2010_3_a0/
%G en
%F UZERU_2010_3_a0
J. T. Astola; E. A. Danielyan; S. K. Arzumanyan. Frequency distributions in bioinformatics: the development. Proceedings of the Yerevan State University. Physical and mathematical sciences, no. 3 (2010), pp. 3-22. http://geodesic.mathdoc.fr/item/UZERU_2010_3_a0/

[1] G. Apic, J. Gough, S.A. Teichmann, “Domain combinations in archaeal, eubacterial and eukaryotic proteomes”, J. Mol. Biol., 301:2 (2001), 311–325 | DOI

[2] H. Jeong, B. Tombor, R. Albert, Z.N. Ottval, “The large-scale organization of metabolic networks”, Nature, 407:6804 (2000), 651–654 | DOI

[3] A. Rzhetsky, S.M. Gomes, “Birth of scale-free molecular networks and the number of distinct DNA and protein domains per genome”, Bioinformatics, 17:10 (2001), 988–996 | DOI

[4] A. Wagner, D.A. Fell, “The small world inside large metabolic networks”, Proc. Roy. Soc. London B, 268:1478 (2001), 1803–1810 | DOI

[5] Y.I. Wolf, G. Kalev, E.V. Koonin, Scale-free networks in biology: new insights into the fundamentals of evolution?, BioEssays, 24:2 (2002), 105–109 | DOI

[6] S. Wuchty, “Scale-free behavior in protein domain networks”, Mol. Biol. Evol., 18:9 (2001), 1694–1702 | DOI

[7] J.J. Ramsden, J. Vohradsky, “Zipf-like behavior in procaryotic protein expression”, Phys.Rev., E, 56:6 (1998), 7777–7780 | DOI

[8] V. A. Kuznetsov, “Distribution associated with stochastic processes of gene expression in a single eukaryotic cell”, EURASIP J. Apll. Signal Process, 2001, no. 4, 285–296 | DOI | Zbl

[9] V. A. Kuznetsov, “Statistics of the Number of Transcripts and Protein Sequences encoded in Genome”, Computational and Statistical Methods to Genomics, eds. W. Zhang, I. Shmulevich, Kluwer, Boston, 2002, 125–171

[10] S.A. Kauffman, The Origins of Order: Self-Organization and Selection in Evolution, Oxford Univ. Press, NY, 1993

[11] T. Saaty, Elements of Queuing Theory, 1983, Dower Publications | MR

[12] J. Astola, E. Danielian, “Regularly varying skewed distributions generated by birthdeath process”, TICSP Series, 27, Tampere, 2004, 1–94

[13] J.U. Yule, “A mathematical theory of evolution, based on the conclusions of Dr. J. C. Willis, F. R. S. Philos”, Trans. Roy. Soc. London B, 213 (1924), 21–87 | DOI

[14] H.A. Simon, “On a Class of Skew Distribution Functions”, Biometrica, 42 (1955), 425–440 | DOI | MR | Zbl

[15] W. Glanzel, A. Schubert, “Predictive aspects of a stochastic model for citation processes”, Inform Process. Manager, 31:1 (1995), 69–80 | DOI

[16] S. Bornholdt, H. Ebel, “World Wide Web Scaling Exponent from Simon's 1955 Model”, Phys. Rev. E, 64 (2001), 035104(R) | DOI

[17] V. J. Oluič-Vukovič, “Simon's generating mechanism: Consequences and their correspondence to empirical facts”, Am. Soc. Inform. Sci., 49:10 (1998), 867–880 | 3.0.CO;2-9 class='badge bg-secondary rounded-pill ref-badge extid-badge'>DOI

[18] V. A. Kuznetsov, “Family of skewed distributions associated with the gene expression and proteome evolution”, Signal Process, 83:4 (2003), 889–910 | DOI | Zbl

[19] E. Danielian, J. Astola, Facta Universitatis (NiŠ), 17 (2004), 405–419 | DOI

[20] E.A. Danielian, G.P. Avagyan, Matem. v Visshey Shcole, 2008, no. 4, 17–23 (in Russian)

[21] E. Danielian, J. Astola, TICSP Series, 34, Tampere, 127–132

[22] J. Astola, E. Danielian, TICSP Series, 31, Tampere, 2006

[23] A.G. Arakelian, The Stability of Frequency Distributions in Biomolecular Models, The abstract of Ph.D. Thesis, YSU press, Yerevan, 2007 (in Russian)

[24] S.P. Yakovlev, The Analysis of Analytic Properties Multi-dimensional Frequency Distributions, The abstract of Ph.D. Thesis, SEUA, Yerevan, 2008 (in Russian)

[25] G.P. Avagyan, The Analysis of Properties of Regularly Varying Distributions, The abstract of Ph.D. Thesis, SEUA, Yerevan, 2009 (in Russian)

[26] E. Seneta, Regularly Varying Functions, Lecture Notes in Mathematics, Springer–Verlag, 1976 | DOI | MR | Zbl

[27] N.H. Bingham, C.M. Goldie, J.L. Tiegels, Regular Variation, Cambridge Univ. Press, 1986

[28] E. Danielian, TICSP Series, 12, Tampere, 2001, 1–80

[29] G.S. Arutyunian, S.P. Yakovlev, Matem. v Visshey Shcole, 2008, no. 4, 60–63 (in Russian)

[30] A.H. Arakelyan, K.A. Mehrdy, Matem. v Visshey Shcole, 2007, no. 3, 5–13 (in Russian)

[31] E.A. Danielian, G.P. Avagyan, Doklady NAS RA, 2009, no. 109 (1), 21–31 | MR

[32] J. Astola, E. Danielian, “On Regularly Varying Skewed Distributions generated by Birth-Death Process”, Facta Universitatis (NiŠ), 19 (2006), 109–131 | DOI

[33] B.S. Thomson, J.B. Bruckner, A.M. Bruckner, Elementary Real Analysis, Pentice–Hall, Upper Saddle River–New Jersey, 2001, 677 pp.

[34] S.K. Arzumanyan, Vestnik SEUA, 2009, no. 2, 34–40 (in Russian)

[35] G.P. Avagyan, Inform. Techn. And Control, 2009, no. 1, 8–18 (in Russian) | MR

[36] W. Feller, An Introduction to Probability Theory and its Applications, v. 1, eds. 1st, John Wiley and Sons, 1957 | MR | Zbl

[37] I.S. Gransteyn, I.M. Ryznik, Table of Integrals, Series and Products, Academic Press, New York and London, 1965 | MR

[38] E Danielian., J. Astola, “On Generating Functions of Pareto and Warring Distributions”, Spectral Methods and Multirate Signal Proc., 37, eds. Bregovic R., Gotchev A, M., 2007, 235–237

[39] A.H. Arakelyan, Matem. v Visshey Shcole, 2006, no. 2, 5–10 (in Russian)

[40] A.H. Arakelyan, Matem. v Visshey Shcole, 2006, no. 2, 70–75 (in Russian)

[41] J. Astola, E.A. Danielian, A.H. Arakelyan, Doklady NAS RA, 108 (2008), 99–109 (in Russian) | MR

[42] S.P. Yakovlev, Proc. of Engineer. Ac. of Armenia, 2007, no. 4, 26–33

[43] S.P. Yakovlev, Modelirovanie Optimizatsia Upravlenie, 2008, no. 11, 139–144, Yerevan (in Russian)

[44] J. Astola, E.A. Danielian, S. Yakovlev, Proc. of Engineer. Ac. of Armenia, 2007, no. 2, 265– 272

[45] W. Feller, Introduction to Probability Theory and its Applications, v. 2, ed. 1st ed., John Wiley and Sons, 1966 | MR | Zbl

[46] V.M. Zolotarev, Transl. of Math. Monographs., 65, Amer. Math. Soc., 1980

[47] J. Astola, E.A. Danielian, A.H. Arakelyan, Doklady NAS RA, 107:1 (2007), 26–36 | MR

[48] D. Farbod, “The Asymptotic Properties of Some Discrete Distributions Generated by Levy’S Law”, Far East Journal of Theoretical Statistic. India, 26:1 (2008), 121–128 | MR | Zbl

[49] D. Farbod, K.V. Gasparian, Matem. v Visshey Shcole, 5(1) (2009), 50–54 (in Russian)

[50] D. Farbod, K.V. Gasparian, Statistica, 68:3–4 (Asymptotic properties of maximum likelihood estimator for some discrete distributions generated by C), 321–326

[51] B.M. Zolotarev, TVP, 1995, no. 39, 354–362 (in Russian) | DOI

[52] J. Astola, E. Danielian, Facta Universitatis (NiS), 20:2 (2007), 119–146

[53] S.K. Arzumanyan, Vestnik-77 SEUA (Politechnik), 1, 2010 (to appear)