Multidimensional term indexing for efficient processing of complex queries
Kybernetika, Tome 40 (2004) no. 3, p. [381].

Voir la notice de l'article provenant de la source Czech Digital Mathematics Library

The area of Information Retrieval deals with problems of storage and retrieval within a huge collection of text documents. In IR models, the semantics of a document is usually characterized using a set of terms. A common need to various IR models is an efficient term retrieval provided via a term index. Existing approaches of term indexing, e. g. the inverted list, support efficiently only simple queries asking for a term occurrence. In practice, we would like to exploit some more sophisticated querying mechanisms, in particular queries based on regular expressions. In this article we propose a multidimensional approach of term indexing providing efficient term retrieval and supporting regular expression queries. Since the term lengths are usually different, we also introduce an improvement based on a new data structure, called BUB-forest, providing even more efficient term retrieval.
Classification : 14Q15, 68P05, 68P10, 68P20
Keywords: term indexing; complex queries; multidimensional data structures; BUB-forest
@article{KYB_2004__40_3_a8,
     author = {Kr\'atk\'y, Michal and Skopal, Tom\'a\v{s} and Sn\'a\v{s}el, V\'aclav},
     title = {Multidimensional term indexing for efficient processing of complex queries},
     journal = {Kybernetika},
     pages = {[381]},
     publisher = {mathdoc},
     volume = {40},
     number = {3},
     year = {2004},
     zbl = {1249.68042},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/KYB_2004__40_3_a8/}
}
TY  - JOUR
AU  - Krátký, Michal
AU  - Skopal, Tomáš
AU  - Snášel, Václav
TI  - Multidimensional term indexing for efficient processing of complex queries
JO  - Kybernetika
PY  - 2004
SP  - [381]
VL  - 40
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/KYB_2004__40_3_a8/
LA  - en
ID  - KYB_2004__40_3_a8
ER  - 
%0 Journal Article
%A Krátký, Michal
%A Skopal, Tomáš
%A Snášel, Václav
%T Multidimensional term indexing for efficient processing of complex queries
%J Kybernetika
%D 2004
%P [381]
%V 40
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/KYB_2004__40_3_a8/
%G en
%F KYB_2004__40_3_a8
Krátký, Michal; Skopal, Tomáš; Snášel, Václav. Multidimensional term indexing for efficient processing of complex queries. Kybernetika, Tome 40 (2004) no. 3, p. [381]. http://geodesic.mathdoc.fr/item/KYB_2004__40_3_a8/