Learning Syntactic Tagging of Macedonian Language
Computer Science and Information Systems, Tome 15 (2018) no. 3.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

This paper presents the creation of machine learning based systems for Part-of-speech tagging of Macedonian language. Four well-known PoS tagger systems implemented for English and Slavic languages: TnT, cyclic dependency network, guided learning framework for bidirectional sequence classification, and dynamic features induction were trained. Orwell’s novel “1984” was manually tagged from the authors and it was used split into training and test set. After the training of the models, a comparison between the models was made. At the end, a POS tagger with an accuracy that reaches 97.5% was achieved, making it very appropriate for the future grammatical tagging of the National corpus of Macedonian language, which is currently in its initial stage. The Part-of-speech tagger that was create is published online and free to use.
Keywords: Part-of-speech tagging, TnT tagger, Cyclic dependency network, Guided learning for bidirectional sequence classification, Dynamic features induction
@article{CSIS_2018_15_3_a18,
     author = {Martin Bonchanoski and Katerina Zdravkova},
     title = {Learning {Syntactic} {Tagging} of {Macedonian} {Language}},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {15},
     number = {3},
     year = {2018},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2018_15_3_a18/}
}
TY  - JOUR
AU  - Martin Bonchanoski
AU  - Katerina Zdravkova
TI  - Learning Syntactic Tagging of Macedonian Language
JO  - Computer Science and Information Systems
PY  - 2018
VL  - 15
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2018_15_3_a18/
ID  - CSIS_2018_15_3_a18
ER  - 
%0 Journal Article
%A Martin Bonchanoski
%A Katerina Zdravkova
%T Learning Syntactic Tagging of Macedonian Language
%J Computer Science and Information Systems
%D 2018
%V 15
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2018_15_3_a18/
%F CSIS_2018_15_3_a18
Martin Bonchanoski; Katerina Zdravkova. Learning Syntactic Tagging of Macedonian Language. Computer Science and Information Systems, Tome 15 (2018) no. 3. http://geodesic.mathdoc.fr/item/CSIS_2018_15_3_a18/