Spectral-Statistical Approach to Latent Profile Periodicity Recognition in DNA Sequences
Matematičeskaâ biologiâ i bioinformatika, Tome 9 (2014) no. 1, pp. 33-62.

Voir la notice de l'article provenant de la source Math-Net.Ru

Earlier, a new notion of latent profile periodicity was introduced by the authors of present work. In 2013 in the “Mathematical biology and bioinformatics” journal an article was published, where a method for recognizing latent periodicity of such a new type was described. This method is based on employing spectral-statistical approach (2S-approach) to DNA sequences analysis. Efficiency of the elaborated method was shown, and a comparison with other known methods was done, particularly with information decomposition method (ID-method). In reply to the publication, the ID-method developers published a commentary in the journal. The commentary criticized the latent profile periodicity recognition method based on the 2S-approach. As follows from the commentary, its authors either did not read our article attentively or they are not aware of some notions of mathematical apparatus used in the article. So, we believe it is necessary to clarify once more the theoretical fundamentals of our method and to draw attention to key moments of its application, which have not been appreciated by our opponents. Moreover in the present work it is shown that the ID-method, contrary to the 2S-approach, is not a sustainable technique for revealing latent periodicity in DNA sequences, what can lead to incorrectness of its results.
@article{MBB_2014_9_1_a15,
     author = {V. A. Kutyrkin and M. B. Chaley},
     title = {Spectral-Statistical {Approach} to {Latent} {Profile} {Periodicity} {Recognition} in {DNA} {Sequences}},
     journal = {Matemati\v{c}eska\^a biologi\^a i bioinformatika},
     pages = {33--62},
     publisher = {mathdoc},
     volume = {9},
     number = {1},
     year = {2014},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/MBB_2014_9_1_a15/}
}
TY  - JOUR
AU  - V. A. Kutyrkin
AU  - M. B. Chaley
TI  - Spectral-Statistical Approach to Latent Profile Periodicity Recognition in DNA Sequences
JO  - Matematičeskaâ biologiâ i bioinformatika
PY  - 2014
SP  - 33
EP  - 62
VL  - 9
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/MBB_2014_9_1_a15/
LA  - ru
ID  - MBB_2014_9_1_a15
ER  - 
%0 Journal Article
%A V. A. Kutyrkin
%A M. B. Chaley
%T Spectral-Statistical Approach to Latent Profile Periodicity Recognition in DNA Sequences
%J Matematičeskaâ biologiâ i bioinformatika
%D 2014
%P 33-62
%V 9
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/MBB_2014_9_1_a15/
%G ru
%F MBB_2014_9_1_a15
V. A. Kutyrkin; M. B. Chaley. Spectral-Statistical Approach to Latent Profile Periodicity Recognition in DNA Sequences. Matematičeskaâ biologiâ i bioinformatika, Tome 9 (2014) no. 1, pp. 33-62. http://geodesic.mathdoc.fr/item/MBB_2014_9_1_a15/

[1] Chalei M. B., Kutyrkin V. A., “Raspoznavanie skrytoi periodichnosti v posledovatelnostyakh DNK”, Matematicheskaya biologiya i bioinformatika, 8:2 (2013), 502–512 (data obrascheniya 30.12.2013) http://www.matbio.org/2013/Chaley_8_502.pdf

[2] Shelenkov A., Skryabin K., Korotkov E., “Search and classification of potential minisatellite sequences from bacterial genomes”, DNA Res., 13 (2006), 89–102 | DOI

[3] Korotkov E. V., Korotkova M. A., Kudryashov N. A., “Information decomposition method for analysis of symbolical sequences”, Physical Letters A, 312 (2003), 198–210 | DOI | MR | Zbl

[4] Korotkov E. V., Shelenkov A. A., Korotkova M. A., “K voprosu o raspoznavanii skrytoi periodichnosti v posledovatelnostyakh DNK”, Matematicheskaya biologiya i bioinformatika, 8:2 (2013), 529–536 (data obrascheniya 30.12.2013) http://www.matbio.org/2013/Korotkov_8_529.pdf

[5] Chaley M., Kutyrkin V., “Profile-Statistical Periodicity of DNA Coding Regions”, DNA Res., 18 (2011), 353–362 | DOI

[6] Chaley M. B., Kutyrkin V. A., “Structure of proteins and latent periodicity in their genes”, Moscow Univ. Biol. Sci. Bull., 65 (2010), 133–135 | DOI

[7] Kutyrkin V. A., Chalei M. B., “Raspoznavanie razlichnykh urovnei v organizatsii kodirovaniya geneticheskoi informatsii”, Vestnik MGTU im. N. E. Baumana. Seriya Estestvennye nauki, 2011, Spets. vypusk No 2 «Matematicheskoe modelirovanie», 200–215

[8] Benson G., “Tandem repeats finder: a program to analyze DNA sequences”, Nucl. Acids Res., 27 (1999), 573–580 | DOI

[9] Chaley M., Kutyrkin V., “Model of perfect tandem repeat with random pattern and empirical homogeneity testing poly-criteria for latent periodicity revelation in biological sequences”, Math. Biosci., 211 (2008), 186–204 | DOI | MR | Zbl

[10] Gelfand Y., Rodriguez A., Benson G., “TRDB — the Tandem Repeats Database”, Nucleic Acids Res., 35 (2007), 80–87 (data obrascheniya 30.12.2013) http://tandem.bu.edu/cgi-bin/trdb/trdb.exe | DOI

[11] Chaley M. B., Nazipova N. N., Kutyrkin V. A., “Statistical methods for detecting latent periodicity patterns in biological sequences: the case of small-size samples”, Pattern Recogn. Image Anal., 19 (2009), 358–367 | DOI

[12] Chalei M. B., Nazipova N. N., Kutyrkin V. A., “Sovmestnoe ispolzovanie razlichnykh kriteriev proverki odnorodnosti dlya vyyavleniya skrytoi periodichnosti v biologicheskikh posledovatelnostyakh”, Mat. biologiya i bioinformatika, 2:1 (2007), 20–35 (data obrascheniya 30.12.2013) http://www.matbio.org/downloads/Chaley2007(2_20).pdf

[13] Chalei M. B., Kutyrkin V. A., Tyulbasheva G. E., Teplukhina E. I., Nazipova N. N., “Issledovanie fenomena skrytoi periodichnosti v genomakh eukarioticheskikh organizmov”, Matematicheskaya biologiya i bioinformatika, 8:2 (2013), 480–501 (data obrascheniya 30.12.2013) http://www.matbio.org/2013/Chaley_8_480.pdf

[14] Kramer G., Matematicheskie metody statistiki, Mir, M., 1975, 648 pp.

[15] Kanehisa M., Goto S., Sato Y., Furumichi M., Tanabe M., “KEGG for integration and interpretation of large-scale molecular datasets”, Nucleic Acids Res., 40 (2012), D109–D114 | DOI

[16] Kutyrkin V. A., Chalei M. B., “Strukturnye razlichiya kodiruyuschikh i nekodiruyuschikh raionov posledovatelnostei DNK genoma cheloveka”, Vestnik MGTU im. N. E. Baumana. Seriya Estestvennye nauki, 2012, Spets. vypusk No 3 «Matematicheskoe modelirovanie», 146–157

[17] Tiwari S., Ramachandran S., Bhattacharya A., Bhattacharya S., Ramaswamy R., “Prediction of probable genes by Fourier analysis of genomic sequences”, Computer Applications in Biosciences, 13:3 (1997), 263–270

[18] FTG: Fast Fourier Transform based GENE Prediction Server (data obrascheniya 30.12.2013) http://www.imtech.res.in/raghava/ftg/

[19] Kulbak S., Teoriya informatsii i statistika, Nauka, M., 1967, 408 pp.

[20] Korotkov E. V., Korotkova M. A., Tulko J. S., “Latent sequence periodicity of some oncogenes and DNA-binding protein genes”, CABIOS, 13 (1997), 37–44