Applying A Normalized Compression Metric To The Measurement Of Dialect Distance
Serdica Journal of Computing, Tome 1 (2007) no. 1, pp. 73-86
Voir la notice de l'article provenant de la source Bulgarian Digital Mathematics Library
The paper discusses the application of a similarity metric based
on compression to the measurement of the distance among Bulgarian dia-
lects. The similarity metric is de ned on the basis of the notion of Kolmo-
gorov complexity of a le (or binary string). The application of Kolmogorov
complexity in practice is not possible because its calculation over a le is an
undecidable problem. Thus, the actual similarity metric is based on a real life
compressor which only approximates the Kolmogorov complexity. To use the
metric for distance measurement of Bulgarian dialects we rst represent the
dialectological data in such a way that the metric is applicable. We propose
two such representations which are compared to a baseline distance between
dialects. Then we conclude the paper with an outline of our future work.
Keywords:
Kolmogorov Complexity, Compression Metric, Dialect Distance, Language Contacts
@article{SJC_2007_1_1_a6,
author = {Simov, Kiril and Osenova, Petya},
title = {Applying {A} {Normalized} {Compression} {Metric} {To} {The} {Measurement} {Of} {Dialect} {Distance}},
journal = {Serdica Journal of Computing},
pages = {73--86},
publisher = {mathdoc},
volume = {1},
number = {1},
year = {2007},
language = {en},
url = {http://geodesic.mathdoc.fr/item/SJC_2007_1_1_a6/}
}
TY - JOUR AU - Simov, Kiril AU - Osenova, Petya TI - Applying A Normalized Compression Metric To The Measurement Of Dialect Distance JO - Serdica Journal of Computing PY - 2007 SP - 73 EP - 86 VL - 1 IS - 1 PB - mathdoc UR - http://geodesic.mathdoc.fr/item/SJC_2007_1_1_a6/ LA - en ID - SJC_2007_1_1_a6 ER -
Simov, Kiril; Osenova, Petya. Applying A Normalized Compression Metric To The Measurement Of Dialect Distance. Serdica Journal of Computing, Tome 1 (2007) no. 1, pp. 73-86. http://geodesic.mathdoc.fr/item/SJC_2007_1_1_a6/