Typological approaches to recognizing genus and subgenus of coronaviruses by structural and non-structural genes
Matematičeskaâ biologiâ i bioinformatika, Tome 19 (2024) no. 2, pp. 593-606.

Voir la notice de l'article provenant de la source Math-Net.Ru

Owing to rapid growth of data on viral genomes in the result of metagenomic researches, bioinformatics and virology are increasingly interacting. There is even the term viral informatics, implying the existence of a whole complex of the databases, knowledge databases about the viruses and software tools for working with them. Among the problems of bioinformatics in virology, it was earlier pointed out to annotation of viral genomes. In the present work on the example of recognizing of subgenus and genus of the coronaviruses a fairly simple and effective typological approach to virus annotation is proposed which uses frequency characteristics of the codons in individual genes. Typological approach is characterized by averaging known data, in particular, such codon frequency characteristics, to determine the similarity or resemblance with them of analogical characteristics for object under consideration. Recognition of subgenus and genus is based on statistics that reveals deviation of coronavirus gene considered from corresponding gene of viral genome with known genus or subgenus. The work compares recognition based on structural genes encoding virion proteins (nucleocapsid protein N and spike protein S) and genes of non-structural proteins combined into a single reading frame ORF1ab. Four typological approaches were discussed in the article. In the first two averaging of all available data and data on prototypical strains only was done over the genera. In the third approach original data on prototype strains were averaged over the subgenera. The fourth approach was based on individual frequency characteristics of prototype strains of the subgenera. Three of the four typological approaches revealed high efficiency in recognizing genus and subgenus of the coronaviruses while using N-gene. The fourth approach proved to be the most effective for identifying genus and subgenus of the coronaviruses. In addition, it has made it possible to reduce the number of codons considered in N-gene of the coronaviruses and to increase recognition efficiency to almost 100%.
@article{MBB_2024_19_2_a8,
     author = {M. B. Chaley and V. A. Kutyrkin},
     title = {Typological approaches to recognizing genus and subgenus of coronaviruses by structural and non-structural genes},
     journal = {Matemati\v{c}eska\^a biologi\^a i bioinformatika},
     pages = {593--606},
     publisher = {mathdoc},
     volume = {19},
     number = {2},
     year = {2024},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/MBB_2024_19_2_a8/}
}
TY  - JOUR
AU  - M. B. Chaley
AU  - V. A. Kutyrkin
TI  - Typological approaches to recognizing genus and subgenus of coronaviruses by structural and non-structural genes
JO  - Matematičeskaâ biologiâ i bioinformatika
PY  - 2024
SP  - 593
EP  - 606
VL  - 19
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/MBB_2024_19_2_a8/
LA  - ru
ID  - MBB_2024_19_2_a8
ER  - 
%0 Journal Article
%A M. B. Chaley
%A V. A. Kutyrkin
%T Typological approaches to recognizing genus and subgenus of coronaviruses by structural and non-structural genes
%J Matematičeskaâ biologiâ i bioinformatika
%D 2024
%P 593-606
%V 19
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/item/MBB_2024_19_2_a8/
%G ru
%F MBB_2024_19_2_a8
M. B. Chaley; V. A. Kutyrkin. Typological approaches to recognizing genus and subgenus of coronaviruses by structural and non-structural genes. Matematičeskaâ biologiâ i bioinformatika, Tome 19 (2024) no. 2, pp. 593-606. http://geodesic.mathdoc.fr/item/MBB_2024_19_2_a8/