Hausdorff Distances for Searching in Binary Text Images
Serdica Journal of Computing, Tome 3 (2009) no. 1, pp. 23-46
Cet article a éte moissonné depuis la source Bulgarian Digital Mathematics Library
Hausdorff distance (HD) seems the most efficient instrument
for measuring how far two compact non-empty subsets of a metric space are
from each other. This paper considers the possibilities provided by HD and
some of its modifications used recently by many authors for resemblance
between binary text images. Summarizing part of the existing word image
matching methods, relied on HD, we investigate a new similar parameterized
method which contains almost all of them as particular cases. Numerical
experiments for searching words in binary text images are carried out with
333 pages of old Bulgarian typewritten text, 200 printed pages of Bulgarian
Chrestomathy from year 1884, and 200 handwritten pages of Slavonic manuscript
from year 1574. They outline how the parameters must be set in order
to use the advantages of the proposed method for the purposes of word
matching in scanned document images.
Keywords:
Hausdorff Distance, Binary Text Image, Word Matching
@article{SJC_2009_3_1_a3,
author = {Andreev, Andrey and Kirov, Nikolay},
title = {Hausdorff {Distances} for {Searching} in {Binary} {Text} {Images}},
journal = {Serdica Journal of Computing},
pages = {23--46},
year = {2009},
volume = {3},
number = {1},
language = {en},
url = {http://geodesic.mathdoc.fr/item/SJC_2009_3_1_a3/}
}
Andreev, Andrey; Kirov, Nikolay. Hausdorff Distances for Searching in Binary Text Images. Serdica Journal of Computing, Tome 3 (2009) no. 1, pp. 23-46. http://geodesic.mathdoc.fr/item/SJC_2009_3_1_a3/