Matrix text models. Text corpora models
Matematičeskoe modelirovanie, Tome 32 (2020) no. 2, pp. 37-57

Voir la notice de l'article provenant de la source Math-Net.Ru

The models of text corpora, formed on the basis of the matrix model of texts in natural languages, are presented. As methods to form models of collections we consider the techniques of computational identification of the thematic structure of the collections. We suggest to use the models for searching for thematically similar text collections and thematic categorization of texts based on text models and text collections. The differences of the proposed models of text collections from the common approaches to their analysis and modeling are analyzed.
Keywords: natural language texts, text corpora, text corpora models, topic models, text models, text information retrieval.
@article{MM_2020_32_2_a2,
     author = {M. G. Kreines and E. M. Kreines},
     title = {Matrix text models. {Text} corpora models},
     journal = {Matemati\v{c}eskoe modelirovanie},
     pages = {37--57},
     publisher = {mathdoc},
     volume = {32},
     number = {2},
     year = {2020},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/MM_2020_32_2_a2/}
}
TY  - JOUR
AU  - M. G. Kreines
AU  - E. M. Kreines
TI  - Matrix text models. Text corpora models
JO  - Matematičeskoe modelirovanie
PY  - 2020
SP  - 37
EP  - 57
VL  - 32
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/MM_2020_32_2_a2/
LA  - ru
ID  - MM_2020_32_2_a2
ER  - 
%0 Journal Article
%A M. G. Kreines
%A E. M. Kreines
%T Matrix text models. Text corpora models
%J Matematičeskoe modelirovanie
%D 2020
%P 37-57
%V 32
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/item/MM_2020_32_2_a2/
%G ru
%F MM_2020_32_2_a2
M. G. Kreines; E. M. Kreines. Matrix text models. Text corpora models. Matematičeskoe modelirovanie, Tome 32 (2020) no. 2, pp. 37-57. http://geodesic.mathdoc.fr/item/MM_2020_32_2_a2/