Text document classification based on mixture models
Kybernetika, Tome 40 (2004) no. 3, p. [293].

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

Finite mixture modelling of class-conditional distributions is a standard method in a statistical pattern recognition. This paper, using bag-of-words vector document representation, explores the use of the mixture of multinomial distributions as a model for class-conditional distribution for multiclass text document classification task. Experimental comparison of the proposed model and the standard Bernoulli and multinomial models as well as the model based on mixture of multivariate Bernoulli distributions was performed using Reuters-21578 and Newsgroups data sets. Preliminary experimental results indicate the effectiveness of the proposed model in a text classification problem.
Classification : 62G05, 62H30, 68T10
Keywords: text classification; multinomialmixture model
@article{KYB_2004__40_3_a2,
     author = {Novovi\v{c}ov\'a, Jana and Mal{\'\i}k, Anton{\'\i}n},
     title = {Text document classification based on mixture models},
     journal = {Kybernetika},
     pages = {[293]},
     publisher = {mathdoc},
     volume = {40},
     number = {3},
     year = {2004},
     mrnumber = {2103933},
     zbl = {1248.62107},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/KYB_2004__40_3_a2/}
}
TY  - JOUR
AU  - Novovičová, Jana
AU  - Malík, Antonín
TI  - Text document classification based on mixture models
JO  - Kybernetika
PY  - 2004
SP  - [293]
VL  - 40
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/KYB_2004__40_3_a2/
LA  - en
ID  - KYB_2004__40_3_a2
ER  - 
%0 Journal Article
%A Novovičová, Jana
%A Malík, Antonín
%T Text document classification based on mixture models
%J Kybernetika
%D 2004
%P [293]
%V 40
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/KYB_2004__40_3_a2/
%G en
%F KYB_2004__40_3_a2
Novovičová, Jana; Malík, Antonín. Text document classification based on mixture models. Kybernetika, Tome 40 (2004) no. 3, p. [293]. http://geodesic.mathdoc.fr/item/KYB_2004__40_3_a2/