Automatic term extraction based on feature combination
Numerical methods and programming, Tome 11 (2010) no. 4, pp. 108-116
Cet article a éte moissonné depuis la source Math-Net.Ru
The paper describes the method of extraction of two-word domain terms combining their features. The features are computed from three sources: the word usage statistics in a domain-specific text collection, the statistics of global search engines, and a domain-specific thesaurus. The evaluation of the approach is based on the terminology from Ontology on natural sciences and technology. We show that the use of multiple features considerably improves the automatic extraction of domain-specific terms.
Keywords:
knowledge acquisition; term extraction; thesaurus; machine learning; search engine; Internet.
@article{VMP_2010_11_4_a13,
author = {N. V. Lukashevich and Yu. M. Logachev},
title = {Automatic term extraction based on feature combination},
journal = {Numerical methods and programming},
pages = {108--116},
year = {2010},
volume = {11},
number = {4},
language = {ru},
url = {http://geodesic.mathdoc.fr/item/VMP_2010_11_4_a13/}
}
N. V. Lukashevich; Yu. M. Logachev. Automatic term extraction based on feature combination. Numerical methods and programming, Tome 11 (2010) no. 4, pp. 108-116. http://geodesic.mathdoc.fr/item/VMP_2010_11_4_a13/