A partial parser with heuristics reducing the number of false chunks in the russian clause
Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Kazanskii Gosudarstvennyi Universitet. Uchenye Zapiski. Seriya Fiziko-Matematichaskie Nauki, Tome 151 (2009) no. 3, pp. 214-228 Cet article a éte moissonné depuis la source Math-Net.Ru

Voir la notice du chapitre de livre

The problem of partial parsing is considered in this paper. New heuristics are proposed to reduce the quantity of chunks falsely exposed at the first step of analysis. A very large influence is rendered by the phenomena of homonymy and polysemy on detection of chunks in russian. Falsely exposed chunks are treated as ones which were found out by a partial parser, but are not actually correct. The method of search of chunks with the use of these heuristics got the name “Right-chunk 4”. The formal task statement is carried out. Computer realization of method of search of chunks is executed as software “Chunk-creator 4”. The estimation of quality is conducted.
Keywords: artificial intelligence, computational linguistics, parsing, chunking.
@article{UZKU_2009_151_3_a18,
     author = {V. A. Bushtedt and V. N. Polyakov},
     title = {A partial parser with heuristics reducing the number of false chunks in the russian clause},
     journal = {U\v{c}\"enye zapiski Kazanskogo universiteta. Seri\^a Fiziko-matemati\v{c}eskie nauki},
     pages = {214--228},
     year = {2009},
     volume = {151},
     number = {3},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/UZKU_2009_151_3_a18/}
}
TY  - JOUR
AU  - V. A. Bushtedt
AU  - V. N. Polyakov
TI  - A partial parser with heuristics reducing the number of false chunks in the russian clause
JO  - Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
PY  - 2009
SP  - 214
EP  - 228
VL  - 151
IS  - 3
UR  - http://geodesic.mathdoc.fr/item/UZKU_2009_151_3_a18/
LA  - ru
ID  - UZKU_2009_151_3_a18
ER  - 
%0 Journal Article
%A V. A. Bushtedt
%A V. N. Polyakov
%T A partial parser with heuristics reducing the number of false chunks in the russian clause
%J Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
%D 2009
%P 214-228
%V 151
%N 3
%U http://geodesic.mathdoc.fr/item/UZKU_2009_151_3_a18/
%G ru
%F UZKU_2009_151_3_a18
V. A. Bushtedt; V. N. Polyakov. A partial parser with heuristics reducing the number of false chunks in the russian clause. Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Kazanskii Gosudarstvennyi Universitet. Uchenye Zapiski. Seriya Fiziko-Matematichaskie Nauki, Tome 151 (2009) no. 3, pp. 214-228. http://geodesic.mathdoc.fr/item/UZKU_2009_151_3_a18/

[1] Popov E. V., Obschenie s EVM na estestvennom yazyke, Editorial URSS, M., 2004, 360 pp.

[2] Smirnov Yu. M., Andreev A. M., Berezkin D. V., Brik A. V., “Ob odnom sposobe postroeniya sintaksicheskogo analizatora tekstov na estestvennom yazyke”, Izv. vuzov. Priborostroenie, 40:5 (1997), 34–42

[3] Ermakov A. E., “Nepolnyi sintaksicheskii analiz teksta v informatsionno-poiskovykh sistemakh”, Kompyuternaya lingvistika i intellektualnye tekhnologii, Trudy Mezhdunar. seminara Dialog'2002, v 2 t., v. 2, Nauka, M., 2002, 180–185

[4] Ermakov A. E., “Tematicheskii analiz teksta s vyyavleniem sverkhfrazovoi struktury”, Inform. tekhnol., 2000, no. 11, 37–40

[5] Ermakov A. E., Pleshko V. V., “Assotsiativnaya model porozhdeniya teksta v zadache klassifikatsii”, Inform. tekhnol., 2000, no. 12, 34–37

[6] Andreev A. M., Berezkin D. V., Brik A. V., Kantonistov Yu. A., “Veroyatnostnyi sintaksicheskii analizator dlya informatsionno-poiskovoi sistemy”, Kompyuternaya khronika, 1999, no. 1, 3–4

[7] Bushtedt V. A., Polyakov V. N., “Chastichnyi sintaksicheskii analizator dlya korporativnoi poiskovoi sistemy”, Trudy Kazan. shkoly po kompyuternoi i kognitivnoi lingvistike, TEL-2006, Otechestvo, Kazan, 2007, 4–15

[8] Bushtedt V., Polyakov V., “Finding chunks with restricrion of distance to dependent word”, Kognitivnoe modelirovanie v lingvistike, Trudy IX mezhdunar. konf., Sofia, Bulgaria, 2007, 38–47

[9] Kuzmin Yu. G., Polyakov V. N., Shmagina E. V., “Metod leksiko-sintaksicheskikh portretov i zadacha razresheniya leksicheskoi mnogoznachnosti”, Trudy Kazan. shkoly po kompyuternoi i kognitivnoi lingvistike, TEL-2006, Otechestvo, Kazan, 2007, 139–147

[10] Hall K., Novak V., “Corrective modeling for non-projective dependency parsing”, Proceedings of the 9th International Workshop on Parsing Technologies, IWPT, 2005, 42–52 | DOI

[11] D. E. Rozental (red.), Sovremennyi russkii yazyk: Leksika i frazeologiya. Fonetika i orfoepiya. Grafika i orfografiya. Slovoobrazovanie. Morfologiya. Sintaksis, Uchebnik dlya vuzov, Vyssh. shk., M., 1984, 735 pp.