Ontology-based multi-label classification of economic articles
Computer Science and Information Systems, Tome 8 (2011) no. 1.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

The paper presents an approach to the task of automatic document categorization in the field of economics. Since the documents can be annotated with multiple keywords (labels), we approach this task by applying and evaluating multi-label classification methods of supervised machine learning. We describe forming a test corpus of 1015 economic documents that we automatically classify using a tool which integrates ontology construction with text mining methods. In our experimental work, we evaluate three groups of multi-label classification approaches: transformation to single-class problems, specialized multi-label models, and hierarchical/ranking models. The classification accuracies of all tested classification models indicate that there is a potential for using all of the evaluated methods to solve this task. The results show the benefits of using complex groups of approaches which benefit from exploiting dependence between the labels. A good alternative to these approaches is also single-class naive Bayes classifiers coupled with the binary relevance transformation approach.
Keywords: ontology, multi-label classification, machine learning, text categorization, economics, document classification
@article{CSIS_2011_8_1_a5,
     author = {Sergeja Vogrin\v{c}i\v{c} and Zoran Bosni\'c},
     title = {Ontology-based multi-label classification of economic articles},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {8},
     number = {1},
     year = {2011},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2011_8_1_a5/}
}
TY  - JOUR
AU  - Sergeja Vogrinčič
AU  - Zoran Bosnić
TI  - Ontology-based multi-label classification of economic articles
JO  - Computer Science and Information Systems
PY  - 2011
VL  - 8
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2011_8_1_a5/
ID  - CSIS_2011_8_1_a5
ER  - 
%0 Journal Article
%A Sergeja Vogrinčič
%A Zoran Bosnić
%T Ontology-based multi-label classification of economic articles
%J Computer Science and Information Systems
%D 2011
%V 8
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2011_8_1_a5/
%F CSIS_2011_8_1_a5
Sergeja Vogrinčič; Zoran Bosnić. Ontology-based multi-label classification of economic articles. Computer Science and Information Systems, Tome 8 (2011) no. 1. http://geodesic.mathdoc.fr/item/CSIS_2011_8_1_a5/