Dynamic programming approach to textual structured objects segmentation in images
Informacionnye tehnologii i vyčislitelnye sistemy, no. 3 (2019), pp. 66-78.

Voir la notice de l'article provenant de la source Math-Net.Ru

This paper deals with the problem of segmentation of images of text fragments with known constraints on the relative position of elements. The model in which the constraints form a path graph is considered. It is shown that the segmentation problem in this case can be solved precisely with use of a dynamic programming algorithm, and this algorithm has an optimal asymptotic complexity. This algorithm was built into two recognition systems. The first system was designed to recognize identity documents, such as passports and driver's licenses. The proposed algorithm was used in this system to extract information fields. To do this, a two-level field hierarchy was introduced, in which the fields were grouped in rows, within which they were ordered from left to right, and the lines themselves were ordered from top to bottom. The second system was designed to recognize license plates in which the proposed algorithm was used to segment plates into individual characters. In this case, the natural ordering of characters from left to right was introduced. Thus, the generality of the proposed approach is demonstrated. Experiments were conducted on a closed data set to measure the quality and performance of the solutions obtained on a mobile phone. Experimental results showed that the solutions obtained are superior in quality to algorithms that do not use constraints on the mutual arrangement of elements, and their complexity allows them to work on mobile devices in real time.
Mots-clés : text segmentation, OCR.
Keywords: dynamic programming, document recognition, image processing
@article{ITVS_2019_3_a5,
     author = {M. A. Povolotskiy and D. V. Tropin and T. S. Chernov and B. I. Savelyev},
     title = {Dynamic programming approach to textual structured objects segmentation in images},
     journal = {Informacionnye tehnologii i vy\v{c}islitelnye sistemy},
     pages = {66--78},
     publisher = {mathdoc},
     number = {3},
     year = {2019},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/ITVS_2019_3_a5/}
}
TY  - JOUR
AU  - M. A. Povolotskiy
AU  - D. V. Tropin
AU  - T. S. Chernov
AU  - B. I. Savelyev
TI  - Dynamic programming approach to textual structured objects segmentation in images
JO  - Informacionnye tehnologii i vyčislitelnye sistemy
PY  - 2019
SP  - 66
EP  - 78
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/ITVS_2019_3_a5/
LA  - ru
ID  - ITVS_2019_3_a5
ER  - 
%0 Journal Article
%A M. A. Povolotskiy
%A D. V. Tropin
%A T. S. Chernov
%A B. I. Savelyev
%T Dynamic programming approach to textual structured objects segmentation in images
%J Informacionnye tehnologii i vyčislitelnye sistemy
%D 2019
%P 66-78
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/ITVS_2019_3_a5/
%G ru
%F ITVS_2019_3_a5
M. A. Povolotskiy; D. V. Tropin; T. S. Chernov; B. I. Savelyev. Dynamic programming approach to textual structured objects segmentation in images. Informacionnye tehnologii i vyčislitelnye sistemy, no. 3 (2019), pp. 66-78. http://geodesic.mathdoc.fr/item/ITVS_2019_3_a5/