Semi-automatic generation of linear event extraction patterns for free texts
Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 155 (2013) no. 4, pp. 99-108

Voir la notice du chapitre de livre provenant de la source Math-Net.Ru

In this paper we describe semi-automatic approach to generating event extraction patterns for free texts. The algorithm is composed of four steps: we automatically extract possible events from a corpus of free documents, cluster them using dependency-based parse tree paths, validate random samples from each cluster and generate linear patterns using positive event clusters. We compare it with the system that uses handcrafted patterns.
Keywords: event extraction, linear patterns, regular expressions, TextMARKER
Mots-clés : RUTA.
@article{UZKU_2013_155_4_a9,
     author = {D. Dzendzik and S. Serebryakov},
     title = {Semi-automatic generation of linear event extraction patterns for free texts},
     journal = {U\v{c}\"enye zapiski Kazanskogo universiteta. Seri\^a Fiziko-matemati\v{c}eskie nauki},
     pages = {99--108},
     publisher = {mathdoc},
     volume = {155},
     number = {4},
     year = {2013},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a9/}
}
TY  - JOUR
AU  - D. Dzendzik
AU  - S. Serebryakov
TI  - Semi-automatic generation of linear event extraction patterns for free texts
JO  - Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
PY  - 2013
SP  - 99
EP  - 108
VL  - 155
IS  - 4
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a9/
LA  - en
ID  - UZKU_2013_155_4_a9
ER  - 
%0 Journal Article
%A D. Dzendzik
%A S. Serebryakov
%T Semi-automatic generation of linear event extraction patterns for free texts
%J Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki
%D 2013
%P 99-108
%V 155
%N 4
%I mathdoc
%U http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a9/
%G en
%F UZKU_2013_155_4_a9
D. Dzendzik; S. Serebryakov. Semi-automatic generation of linear event extraction patterns for free texts. Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 155 (2013) no. 4, pp. 99-108. http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a9/