Searching Pair Points of Triplet Periodicity Phase Shifts in the Genes of 17 Bacterial Genomes
Matematičeskaâ biologiâ i bioinformatika, Tome 7 (2012), pp. 461-475.

Voir la notice de l'article provenant de la source Math-Net.Ru

This paper presents developed mathematical method for searching pair phase shifts of triplet periodicity. Such phase shifts could be a potential frameshifts in genes resulting the insertions of quite long DNA fragments. We developed software and checked if there are pair phase shifts of triplet periodicity in genes of 17 bacteria genomes. The results shows that there is about 1 percent of bacteria genes having pair phase shift of triplet periodicity in these 17 genoms. This paper also describes developed method for visualization of pair phase shifts of triplet periodicity and gives an examples of such shifts. Research results were partially confirmed by the search for aminoacids sequences similarities that had been made by alternative reading frames. The connection between pair phase shifts of triplet periodicity and frameshift in genes is disscussed.
@article{MBB_2012_7_a4,
     author = {Valentina M. Pugacheva and Alexander E. Korotkov and Eugene V. Korotkov},
     title = {Searching {Pair} {Points} of {Triplet} {Periodicity} {Phase} {Shifts} in the {Genes} of 17 {Bacterial} {Genomes}},
     journal = {Matemati\v{c}eska\^a biologi\^a i bioinformatika},
     pages = {461--475},
     publisher = {mathdoc},
     volume = {7},
     year = {2012},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/MBB_2012_7_a4/}
}
TY  - JOUR
AU  - Valentina M. Pugacheva
AU  - Alexander E. Korotkov
AU  - Eugene V. Korotkov
TI  - Searching Pair Points of Triplet Periodicity Phase Shifts in the Genes of 17 Bacterial Genomes
JO  - Matematičeskaâ biologiâ i bioinformatika
PY  - 2012
SP  - 461
EP  - 475
VL  - 7
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/MBB_2012_7_a4/
LA  - ru
ID  - MBB_2012_7_a4
ER  - 
%0 Journal Article
%A Valentina M. Pugacheva
%A Alexander E. Korotkov
%A Eugene V. Korotkov
%T Searching Pair Points of Triplet Periodicity Phase Shifts in the Genes of 17 Bacterial Genomes
%J Matematičeskaâ biologiâ i bioinformatika
%D 2012
%P 461-475
%V 7
%I mathdoc
%U http://geodesic.mathdoc.fr/item/MBB_2012_7_a4/
%G ru
%F MBB_2012_7_a4
Valentina M. Pugacheva; Alexander E. Korotkov; Eugene V. Korotkov. Searching Pair Points of Triplet Periodicity Phase Shifts in the Genes of 17 Bacterial Genomes. Matematičeskaâ biologiâ i bioinformatika, Tome 7 (2012), pp. 461-475. http://geodesic.mathdoc.fr/item/MBB_2012_7_a4/

[1] Wei Q., Li L., Chen D. J. (eds.), DNA Repair, Genetic Instability, and Cancer, World Scientific, 2007

[2] Watson J. D., Levine M., Baker T. A., Gann A., Bell S. P., Molecular Biology of the Gene, Benjamin-Cummings Pub. Corp., 2007

[3] Okamura K., Feuk L., Marques-Bonet T., Navarro A., Scherer S. W., “Frequent appearance of novel protein-coding sequences by frameshift translation”, Genomics, 88 (2006), 690–697 <ext-link ext-link-type='doi' href='https://doi.org/10.1016/j.ygeno.2006.06.009'>10.1016/j.ygeno.2006.06.009</ext-link>

[4] Raes J., Van de Peer Y., “Functional divergence of proteins through frameshift mutations”, Trends Genet., 21 (2005), 428–431 <ext-link ext-link-type='doi' href='https://doi.org/10.1016/j.tig.2005.05.013'>10.1016/j.tig.2005.05.013</ext-link>

[5] Kramer E. M., Su H.-J., Wu C.-C., Hu J.-M., “A simplified explanation for the frameshift mutation that created a novel C-terminal motif in the APETALA3 gene lineageBMC”, Evolutionary Biology, 6 (2006), 30 <ext-link ext-link-type='doi' href='https://doi.org/10.1186/1471-2148-6-30'>10.1186/1471-2148-6-30</ext-link>

[6] States D. J., Botstein D., “Molecular sequence accuracy and the analysis of protein coding regions”, Proc. Natl. Acad. Sci., USA, 88 (1991), 5518–5522 <ext-link ext-link-type='doi' href='https://doi.org/10.1073/pnas.88.13.5518'>10.1073/pnas.88.13.5518</ext-link>

[7] Pearson W. R., Wood T., Zhang Z., Miller W., “Comparison of DNA sequences with protein sequences”, Genomics, 46 (1997), 24–36 <ext-link ext-link-type='doi' href='https://doi.org/10.1006/geno.1997.4995'>10.1006/geno.1997.4995</ext-link>

[8] Birney E., Thompson J., Gibson T., “PairWise and SearchWise: finding the optimal alignment in a simultaneous comparison of a protein profile against all DNA translation frames”, Nucl. Acids Res., 24 (1996), 2730–2739 <ext-link ext-link-type='doi' href='https://doi.org/10.1093/nar/24.14.2730'>10.1093/nar/24.14.2730</ext-link>

[9] Guan X., Uberbacher E. C., “Alignments of DNA and protein sequences containing frameshift errors”, Comput. Appl. Biosci., 12 (1996), 31–40

[10] Antonov I., Borodovsky M., “Genetack: frameshift identification in protein-coding sequences by the Viterbi algorithm”, J. Bioinform Comput Biol., 8:3, Jun. (2010), 535–51 <ext-link ext-link-type='doi' href='https://doi.org/10.1142/S0219720010004847'>10.1142/S0219720010004847</ext-link><ext-link ext-link-type='mr-item-id' href='http://mathscinet.ams.org/mathscinet-getitem?mr=2575440'>2575440</ext-link>

[11] Kislyuk A., Lomsadze A., Lapidus A. L., Borodovsky M., “Frameshift detection in prokaryotic genomic sequences”, Int. J. Bioinform. Res. Appl., 5:4 (2009), 458–477 <ext-link ext-link-type='doi' href='https://doi.org/10.1504/IJBRA.2009.027519'>10.1504/IJBRA.2009.027519</ext-link>

[12] Fichant G. A., Quentin Y., “A frameshift error detection algorithm for DNA sequencing projects”, Nucleic Acids Res., 23 (1995), 2900–2908 <ext-link ext-link-type='doi' href='https://doi.org/10.1093/nar/23.15.2900'>10.1093/nar/23.15.2900</ext-link>

[13] Médigue C., Rose M., Viari A., Danchin A., “Detecting and analyzing DNA sequencing errors: toward a higher quality of the Bacillus subtilis genome sequence”, Genome Res., 9 (1999), 1116–1127 <ext-link ext-link-type='doi' href='https://doi.org/10.1101/gr.9.11.1116'>10.1101/gr.9.11.1116</ext-link>

[14] Schiex T., Gouzy J., Moisan A., Oliveira Y. D., “FrameD: a flexible program for quality check and gene prediction in prokaryotic genomes and noisy matured eukaryotic sequences”, Nucleic Acids Res., 31 (2003), 3738–3741 <ext-link ext-link-type='doi' href='https://doi.org/10.1093/nar/gkg610'>10.1093/nar/gkg610</ext-link>

[15] Frenkel F. E., Korotkov E. V., “Classification analysis of triplet periodicity in protein-coding regions of genes”, Gene, 421 (2008), 52–60 <ext-link ext-link-type='doi' href='https://doi.org/10.1016/j.gene.2008.06.012'>10.1016/j.gene.2008.06.012</ext-link>

[16] Frenkel F. E., Korotkov E. V., “Using triplet periodicity of nucleotide sequences for finding potential reading frame shifts in genes”, DNA Res., 16 (2009), 105–114 <ext-link ext-link-type='doi' href='https://doi.org/10.1093/dnares/dsp002'>10.1093/dnares/dsp002</ext-link>

[17] Korotkov E. V., Korotkova M. A., “Study of the triplet periodicity phase shifts in genes”, Journal of Integrative Bioinformatics, 7 (2010), 131–141

[18] Fickett J. W., “Predictive methods using nucleotide sequences”, Methods Biochem. Anal., 39 (1998), 231–245 <ext-link ext-link-type='doi' href='https://doi.org/10.1002/9780470110607.ch10'>10.1002/9780470110607.ch10</ext-link>

[19] Staden R., “Statistical and structural analysis of nucleotide sequences”, Methods Mol. Biol., 25 (1994), 69–77

[20] Baxevanis A. D., “Predictive methods using DNA sequences”, Methods Biochem. Anal., 43 (2001), 233–252 <ext-link ext-link-type='doi' href='https://doi.org/10.1002/0471223921.ch10'>10.1002/0471223921.ch10</ext-link>

[21] Gutierrez G., Oliver J. L., Marin A., “On the origin of the periodicity of three in protein coding DNA sequences”, J. Theor. Biol., 167:4, Apr. 21 (1994), 413–414 <ext-link ext-link-type='doi' href='https://doi.org/10.1006/jtbi.1994.1080'>10.1006/jtbi.1994.1080</ext-link>

[22] Gao J., Qi Y., Cao Y., Tung W.-W., “Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences”, Journal of Biomedicine and Biotechnology, 2 (2005), 139–146 <ext-link ext-link-type='doi' href='https://doi.org/10.1155/JBB.2005.139'>10.1155/JBB.2005.139</ext-link>

[23] Yin C., Yau S. S., “Prediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence”, Journal of Theoretical Biology, 247 (2007), 687–694 <ext-link ext-link-type='doi' href='https://doi.org/10.1016/j.jtbi.2007.03.038'>10.1016/j.jtbi.2007.03.038</ext-link><ext-link ext-link-type='mr-item-id' href='http://mathscinet.ams.org/mathscinet-getitem?mr=2479617'>2479617</ext-link>

[24] Eskesen S. T., Eskesen F. N., Kinghorn B., Ruvinsky A., “Periodicity of DNA in exons”, BMC Molecular Biology, 5 (2004), 12 <ext-link ext-link-type='doi' href='https://doi.org/10.1186/1471-2199-5-12'>10.1186/1471-2199-5-12</ext-link>

[25] Bibb M. J., Findlay P. R., Johnson M. W., “The relationship between base composition and codon usage in bacterial genes and its use for the simple and reliable identification of protein-coding sequences”, Gene, 30:1–3, Oct. (1984), 157–166 <ext-link ext-link-type='doi' href='https://doi.org/10.1016/0378-1119(84)90116-1'>10.1016/0378-1119(84)90116-1</ext-link>

[26] Konopka A. K., “Sequences and codes: fundamentals of biomolecular cryptology”, Biocomputing: Informatics and genome projects, ed. Smith D., Academic Press, San Diego, CA, 1994, 119–174

[27] Trifonov E. N., “Elucidating sequence codes: three codes for evolution”, Ann. NY Acad. Sci., 870 (1999), 330–338 <ext-link ext-link-type='doi' href='https://doi.org/10.1111/j.1749-6632.1999.tb08894.x'>10.1111/j.1749-6632.1999.tb08894.x</ext-link>

[28] Eigen M., Winkler-Oswatitsch R., “Transfer-RNA: the early adaptor”, Naturwissenschaften, 68 (1981), 217–228 <ext-link ext-link-type='doi' href='https://doi.org/10.1007/BF01047323'>10.1007/BF01047323</ext-link>

[29] Zoltowski M., Is DNA Code Periodicity Only Due to CUF — Codons Usage Frequency?, Conf. Proc. IEEE Eng. Med. Biol. Soc., 1 (2007), 1383–1386

[30] Antezana M. A., Kreitman M., “The nonrandom location of synonymous codons suggests that reading frame-independent forces have patterned codon preferences”, J. Mol. Evol., 49:1 (1999), 36–43 <ext-link ext-link-type='doi' href='https://doi.org/10.1007/PL00006532'>10.1007/PL00006532</ext-link><ext-link ext-link-type='mr-item-id' href='http://mathscinet.ams.org/mathscinet-getitem?mr=2504535'>2504535</ext-link>

[31] Korotkov E. V., Korotkova M. A., Frenkel F. E., Kudryashov N. A., “The Informational Concept of Searching for Periodicity in Symbol Sequences”, Molecular Biology, 37 (2003), 372–386 <ext-link ext-link-type='doi' href='https://doi.org/10.1023/A:1024231109360'>10.1023/A:1024231109360</ext-link>

[32] Suvorova Y. M., Rudenko V. M., Korotkov E. V., “Detection change points of triplet periodicity of gene”, Gene, 491 (2012), 58–64 <ext-link ext-link-type='doi' href='https://doi.org/10.1016/j.gene.2011.08.032'>10.1016/j.gene.2011.08.032</ext-link>

[33] Strauss B. S., “Frameshift mutation, microsatellites and mismatch repair”, Mutation Research, 437 (1999), 195–203 <ext-link ext-link-type='doi' href='https://doi.org/10.1016/S1383-5742(99)00066-6'>10.1016/S1383-5742(99)00066-6</ext-link>

[34] Korotkova M. A., Kudryashov N. A., Korotkov E. V., “An approach for searching insertions in bacterial genes leading to the phase shift of triplet periodicity”, Genomics Proteomics Bioinformatics, 9 (2011), 158–170 <ext-link ext-link-type='doi' href='https://doi.org/10.1016/S1672-0229(11)60019-3'>10.1016/S1672-0229(11)60019-3</ext-link>