Semi-automatic generation of linear event extraction patterns for free texts
Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 155 (2013) no. 4, pp. 99-108
Voir la notice du chapitre de livre provenant de la source Math-Net.Ru
In this paper we describe semi-automatic approach to generating event extraction patterns for free texts. The algorithm is composed of four steps: we automatically extract possible events from a corpus of free documents, cluster them using dependency-based parse tree paths, validate random samples from each cluster and generate linear patterns using positive event clusters. We compare it with the system that uses handcrafted patterns.
Keywords:
event extraction, linear patterns, regular expressions, TextMARKER
Mots-clés : RUTA.
Mots-clés : RUTA.
@article{UZKU_2013_155_4_a9,
author = {D. Dzendzik and S. Serebryakov},
title = {Semi-automatic generation of linear event extraction patterns for free texts},
journal = {U\v{c}\"enye zapiski Kazanskogo universiteta. Seri\^a Fiziko-matemati\v{c}eskie nauki},
pages = {99--108},
publisher = {mathdoc},
volume = {155},
number = {4},
year = {2013},
language = {en},
url = {http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a9/}
}
TY - JOUR AU - D. Dzendzik AU - S. Serebryakov TI - Semi-automatic generation of linear event extraction patterns for free texts JO - Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki PY - 2013 SP - 99 EP - 108 VL - 155 IS - 4 PB - mathdoc UR - http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a9/ LA - en ID - UZKU_2013_155_4_a9 ER -
%0 Journal Article %A D. Dzendzik %A S. Serebryakov %T Semi-automatic generation of linear event extraction patterns for free texts %J Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki %D 2013 %P 99-108 %V 155 %N 4 %I mathdoc %U http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a9/ %G en %F UZKU_2013_155_4_a9
D. Dzendzik; S. Serebryakov. Semi-automatic generation of linear event extraction patterns for free texts. Učënye zapiski Kazanskogo universiteta. Seriâ Fiziko-matematičeskie nauki, Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, Tome 155 (2013) no. 4, pp. 99-108. http://geodesic.mathdoc.fr/item/UZKU_2013_155_4_a9/