Linguistic and statistical analysis of the terminology for~constructing the thesaurus of a specified field
Modelirovanie i analiz informacionnyh sistem, Tome 22 (2015) no. 6, pp. 834-851.

Voir la notice de l'article provenant de la source Math-Net.Ru

The paper is devoted to the analysis of the body of terms and terminological sources for further automation of constructing the thesaurus of a subject area, which is regarded as poetics in our work. Preliminary systematization of terminology with a linguistic and statistical approach forms the body of semantically related concepts to automate extraction of semantic relationships between terms that define the structure of the thesaurus of the specified field.
Keywords: thesaurus, semantic similarity metrics, data mining, computer linguistics.
@article{MAIS_2015_22_6_a7,
     author = {M. S. Karyaeva},
     title = {Linguistic and statistical analysis of the terminology for~constructing the thesaurus of a specified field},
     journal = {Modelirovanie i analiz informacionnyh sistem},
     pages = {834--851},
     publisher = {mathdoc},
     volume = {22},
     number = {6},
     year = {2015},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/MAIS_2015_22_6_a7/}
}
TY  - JOUR
AU  - M. S. Karyaeva
TI  - Linguistic and statistical analysis of the terminology for~constructing the thesaurus of a specified field
JO  - Modelirovanie i analiz informacionnyh sistem
PY  - 2015
SP  - 834
EP  - 851
VL  - 22
IS  - 6
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/MAIS_2015_22_6_a7/
LA  - ru
ID  - MAIS_2015_22_6_a7
ER  - 
%0 Journal Article
%A M. S. Karyaeva
%T Linguistic and statistical analysis of the terminology for~constructing the thesaurus of a specified field
%J Modelirovanie i analiz informacionnyh sistem
%D 2015
%P 834-851
%V 22
%N 6
%I mathdoc
%U http://geodesic.mathdoc.fr/item/MAIS_2015_22_6_a7/
%G ru
%F MAIS_2015_22_6_a7
M. S. Karyaeva. Linguistic and statistical analysis of the terminology for~constructing the thesaurus of a specified field. Modelirovanie i analiz informacionnyh sistem, Tome 22 (2015) no. 6, pp. 834-851. http://geodesic.mathdoc.fr/item/MAIS_2015_22_6_a7/

[1] Lingvisticheskaya ontologiya «Tezaurus Rutez», http://www.labinform.ru/pub/ruthes/

[2] Tezaurus WordNet, http://wordnet.princeton.edu/

[3] Boikov V. N. et al., “Thesaurus as a poetological tool”, Modeling and Analysis of Information Systems, 17:1 (2010), 5–24 (in Russian)

[4] Boikov V. N., “Semanticheskaya model «Tezaurusa po poehtologii» v sostave informacionno-analiticheskoj sistemy”, Materialy nauchnoj konferencii «Internet i sovremennoe obshchestvo», 2013, 273–279 (in Russian)

[5] Boikov V. N., “Predmetno-orientirovannyj tezaurus v otkrytoj informacionno-analiticheskoj sisteme”, Ehlektronnye biblioteki: perspektivy, metody i tekhnologii, ehlektronnye kollekcii, RCDL'2013, 2013, 70–76 (in Russian)

[6] Tezaurus po poetologii, http://wikipoetics.ru/

[7] Hearst M. A., “Automated discovery of WordNet relations”, WordNet: an electronic lexical database, 1998, 131–153

[8] Panchenko A. I., “Izvlechenie semanticheskih otnoshenij iz statej Vikipedii s pomoshchyu algoritmov blizhajshih sosedej”, Otkrytye sistemy, 16 (2012), 18–27 (in Russian)

[9] Serelex: Poisk semanticheski svyaznykh slov, http://serelex.org/ru

[10] Kiselev Yu. A., “Metod izvlecheniya rodovidovyh otnoshenij mezhdu sushchestvitelnymi iz opredelenij tolkovyh slovarej”, Programmnaya inzheneriya, 10 (2015), 38–48 (in Russian)

[11] Kratkaya literaturnaya ehnciklopediya, Sov. Ehncikl., 1962–1978 (in Russian)

[12] Literaturnaya ehnciklopediya, Kom. akad., 1929–1939 (in Russian)

[13] Slovar literaturnyh terminov, Izd-vo L. D. Frenkel, 1925 (in Russian)

[14] Kvyatkovskij A. P., Poehticheskij slovar, Sov. Ehncikl., 1966 (in Russian)

[15] Bolshaya sovetskaya ehnciklopediya, Sov. ehncikl., 1969–1978 (in Russian)

[16] Zaliznyak A. A., Grammaticheskij slovar russkogo yazyka, Slovoizmenenie, 1980 (in Russian)

[17] Mystem, https://tech.yandex.ru/mystem/

[18] Segalovich I., “A Fast Morphological Algorithm with Unknown Word Guessing Induced by a Dictionary for a Web Search Engine”, MLMTA, 2003, 273–280

[19] PyMystem, https://pypi.python.org/pypi/pymystem3/0.1.1

[20] PyMorphy2, https://pymorphy2.readthedocs.org/en/latest/

[21] OpenCorpora, http://opencorpora.org/

[22] FrameWork Django, https://www.djangoproject.com/

[23] Lyashevskaya O. N., Chastotnyj slovar sovremennogo russkogo yazyka (na materialah Nacionalnogo korpusa russkogo yazyka), Azbukovnik, 2009 (in Russian)