Application of the Monte Carlo method for searching for the possible reading frameshifts in genes
Matematičeskaâ biologiâ i bioinformatika, Tome 6 (2011) no. 1, pp. 79-91.

Voir la notice de l'article provenant de la source Math-Net.Ru

In the article we presented the method for searching for the possible reading frameshifts in genes based on revealing change points of triplet frequencies distribution. The statistical significance was estimated by Monte Carlo method. Correctness of the introduced method was demonstrated by using it to analysis the DNA sequences with artificial indels. The method developed was applied for searching for the change points in DNA sequences from databank KEGG GENES. It was revealed more than 140 thousands genes with change points at the significance level equal to 6 %. We classified sequences containing change points by field description in databank KEGG GENES. It appeared that many of them are pseudogenes or they were annotated earlier as sequences containing frameshifts. In addition to these sequences the change points were detected in many genes coding of PE-PGRS, cation channel family protein, PPE family protein and others. The relationship between change points and reading frameshifts in genes is discussed.
@article{MBB_2011_6_1_a0,
     author = {V. M. Rudenko and E. V. Korotkov},
     title = {Application of the {Monte} {Carlo} method for searching for the possible reading frameshifts in genes},
     journal = {Matemati\v{c}eska\^a biologi\^a i bioinformatika},
     pages = {79--91},
     publisher = {mathdoc},
     volume = {6},
     number = {1},
     year = {2011},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/MBB_2011_6_1_a0/}
}
TY  - JOUR
AU  - V. M. Rudenko
AU  - E. V. Korotkov
TI  - Application of the Monte Carlo method for searching for the possible reading frameshifts in genes
JO  - Matematičeskaâ biologiâ i bioinformatika
PY  - 2011
SP  - 79
EP  - 91
VL  - 6
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/MBB_2011_6_1_a0/
LA  - ru
ID  - MBB_2011_6_1_a0
ER  - 
%0 Journal Article
%A V. M. Rudenko
%A E. V. Korotkov
%T Application of the Monte Carlo method for searching for the possible reading frameshifts in genes
%J Matematičeskaâ biologiâ i bioinformatika
%D 2011
%P 79-91
%V 6
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/MBB_2011_6_1_a0/
%G ru
%F MBB_2011_6_1_a0
V. M. Rudenko; E. V. Korotkov. Application of the Monte Carlo method for searching for the possible reading frameshifts in genes. Matematičeskaâ biologiâ i bioinformatika, Tome 6 (2011) no. 1, pp. 79-91. http://geodesic.mathdoc.fr/item/MBB_2011_6_1_a0/

[1] Watson J. D., Levine M., Baker T. A., Gann A., Bell S. P., Molecular Biology of the Gene, Benjamin-Cummings Pub. Corp., 2007

[2] Salem I. H., Kamoun F. et al., Mutations in LAMA2 and CAPN3 genes associated with genetic and phenotypic heterogeneities within a single consanguineous family involving both congenital and progressive muscular dystrophies, Bioscience Reports, data obrascheniya: 09.03.2011 http://journals.academia.edu/BioscienceReports

[3] Stallmeyer B., Fenge H., Nowak-Gottl U., Schulze-Bahr E., “Mutational spectrum in the cardiac transcription factor gene NKX2.5 (CSX) associated with congenital heart disease”, Clin Genet., 78:6 (2010), 533–540 | DOI

[4] Posfai J., Roberts R. J., “Finding errors in DNA sequences”, Proc. Natl. Acad. Sci. USA, 89 (1992), 4698–4702 | DOI

[5] Claverie J.-M., “Detecting frame shifts by amino acid sequence comparison”, J. Mol. Biol., 234:4 (1993), 1140–1157 | DOI

[6] Okamura K., Feuk L., Marquis-Bonet T., Navarro A., Scherer S. W., “Frequent appearance of novel protein-coding sequences by frameshift translation”, Genomics, 88 (2006), 690–697 | DOI

[7] Raes J., Van de Peer Y., “Functional divergence of proteins through frameshift mutations”, Trends Genet., 21 (2005), 428–431 | DOI

[8] Schiex Th., Gouzy J., Moisan A., Oliveira Y., “FrameD: a flexible program for quality check and gene prediction in prokaryotic genomes and noisy matured eukaryotic sequences”, NAR, 31:13 (2003), 3738–3741 | DOI

[9] Kislyuk A., Lomsadze A., Lapidus A. L., Borodovsky M., “Frameshift detection in prokaryotic genomic sequences”, Int. J. Bioinformatics Research and Applications, 5:4 (2009), 458–477 | DOI

[10] Fichant G. A., Quentini Y., “A frameshift error detection algorithm for DNA sequencing projects”, NAR, 23:15 (1995), 2900–2908 | DOI

[11] Bennetzen J. L., Hall B. D., “Codon selection in yeast”, J. Biol. Chem., 257 (1982), 3026–3031

[12] Carlstein E., Muller H.-G., Siegmund D. (eds.), Change-point problems, Lecture notes monograph series, 23, Institute of mathematical statistics, 1994 | MR | Zbl

[13] Kanehisa M., Goto S., Kawashima S., Okuno Y., Hattori M., “The KEGG resources for deciphering the genome”, Nucleic Acids Res., 32 (2004), 277–280 | DOI

[14] Korotkov E. V., Rudenko V. M., “Sdvig fazy tripletnoi periodichnosti v nukleotidnykh posledovatelnostyakh genov”, Matematicheskaya biologiya i bioinformatika, 4:2 (2009), 66–80

[15] Trifonov E. N., “Elucidating sequence codes: three codes for evolution”, Ann. NY Acad. Sci., 870 (1999), 330–338 | DOI

[16] Eigen M., Winkler-Oswatitsch R., “Transfer-RNA: the early adaptor”, Naturwissenschaften, 68 (1981), 217–228 | DOI

[17] Zoltowski M., “Is DNA Code Periodicity Only Due to CUF-Codons Usage Frequency?”, Conf. Proc. IEEE Eng Med. Biol. Soc., v. 1, 2007, 1383–1386

[18] Antezana M. A., Kreitman M., “The nonrandom location of synonymous codons suggests that reading frame-independent forces have patterned codon preferences”, J. Mol. Evol., 49:1 (1999), 36–43 | DOI | MR

[19] Kullback S., Information Theory and Statistics, Wiley, New York, 1959 | MR | Zbl

[20] Filina M. V., Zubkov A. M., “Exact computation of Pearson statistics distribution and some experimental results”, Austr. J. Statist., 37:1 (2008), 129–135

[21] Sprinthall R. C., Basic Statistical Analysis, Seventh Edition, Pearson Education Group, Boston, 2003

[22] Carpena P., Bernaola-Galván P., Román-Roldán R., Oliver J., “A simple and species-independent coding measure”, Gene, 300:1–2 (2002), 97–104 | DOI

[23] Korotkov E. V., Korotkova M. A., “Study of the triplet periodicity phase shifts in genes”, Journal of Integrative Bioinformatics, 7 (2010), 131–141