A system for automatic construction of knowledge graphs of mathematical documents
Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 165 (2023) no. 3, pp. 264-281

Voir la notice du chapitre de livre provenant de la source Math-Net.Ru

This article outlines the process of creating an automated system for knowledge graph construction from collections of mathematical documents in LATEX format. The MathCollectionOntology, which defines the types of objects and relationships in knowledge graphs, was developed. The introduced toolkit includes methods for extracting mathematical terms, browsing and identifying document topics, extracting entities from LATEX code, and calculating statistical parameters of the graph. The parsed entities are mathematical terms, topics generated through the Latent Dirichlet Allocation, UDC codes, used formulas, author affiliations, cited literature, and others. The knowledge graph captures each extracted object using specific types of relationships defined in the MathCollectionOntology. Here, a knowledge graph was coined for a collection of articles published in Izvestiya VUZov. Matematika journal (1114 Russian-language documents in LATEX format). The thematic terms of the document topics were described. The quantitative parameters of the constructed knowledge graph were obtained.
Keywords: knowledge graph construction, linked open data, topic modeling, mathematical article, text processing.
@article{UZKU_2023_165_3_a6,
     author = {O. A. Nevzorova and B. T. Gizatullin},
     title = {A system for automatic construction of knowledge graphs of mathematical documents},
     journal = {U\v{c}\"enye zapiski Kazanskogo universiteta. Seri\^a Fiziko-matemati\v{c}eskie nauki},
     pages = {264--281},
     publisher = {mathdoc},
     volume = {165},
     number = {3},
     year = {2023},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/UZKU_2023_165_3_a6/}
}
TY  - JOUR
AU  - O. A. Nevzorova
AU  - B. T. Gizatullin
TI  - A system for automatic construction of knowledge graphs of mathematical documents
JO  - Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
PY  - 2023
SP  - 264
EP  - 281
VL  - 165
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/UZKU_2023_165_3_a6/
LA  - ru
ID  - UZKU_2023_165_3_a6
ER  - 
%0 Journal Article
%A O. A. Nevzorova
%A B. T. Gizatullin
%T A system for automatic construction of knowledge graphs of mathematical documents
%J Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
%D 2023
%P 264-281
%V 165
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/UZKU_2023_165_3_a6/
%G ru
%F UZKU_2023_165_3_a6
O. A. Nevzorova; B. T. Gizatullin. A system for automatic construction of knowledge graphs of mathematical documents. Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 165 (2023) no. 3, pp. 264-281. http://geodesic.mathdoc.fr/item/UZKU_2023_165_3_a6/