Using Part-of-Speech Tags as Deep-Syntax Indicators in Determining Short-Text Semantic Similarity
Computer Science and Information Systems, Tome 12 (2015) no. 1.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

This paper presents POST STSS, a method of determining short-text semantic similarity in which part-of-speech tags are used as indicators of the deeper syntactic information usually extracted by more advanced tools like parsers and semantic role labelers. Our model employs a part-of-speech weighting scheme and is based on a statistical bag-of-words approach. It does not require either hand-crafted knowledge bases or advanced syntactic tools, which makes it easily applicable to languages with limited natural language processing resources. By using a paraphrase recognition test, we demonstrate that our system achieves a higher accuracy than all existing statistical similarity algorithms and solutions of a more structural kind.
Keywords: short-text semantic similarity, statistical similarity, corpus-based measures, part-of-speech tags, POS weighting, syntactic information, bag-of-words model, natural language processing
@article{CSIS_2015_12_1_a1,
     author = {Vuk Batanovi\'c and Dragan Boji\'c},
     title = {Using {Part-of-Speech} {Tags} as {Deep-Syntax} {Indicators} in {Determining} {Short-Text} {Semantic} {Similarity}},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {12},
     number = {1},
     year = {2015},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2015_12_1_a1/}
}
TY  - JOUR
AU  - Vuk Batanović
AU  - Dragan Bojić
TI  - Using Part-of-Speech Tags as Deep-Syntax Indicators in Determining Short-Text Semantic Similarity
JO  - Computer Science and Information Systems
PY  - 2015
VL  - 12
IS  - 1
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2015_12_1_a1/
ID  - CSIS_2015_12_1_a1
ER  - 
%0 Journal Article
%A Vuk Batanović
%A Dragan Bojić
%T Using Part-of-Speech Tags as Deep-Syntax Indicators in Determining Short-Text Semantic Similarity
%J Computer Science and Information Systems
%D 2015
%V 12
%N 1
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2015_12_1_a1/
%F CSIS_2015_12_1_a1
Vuk Batanović; Dragan Bojić. Using Part-of-Speech Tags as Deep-Syntax Indicators in Determining Short-Text Semantic Similarity. Computer Science and Information Systems, Tome 12 (2015) no. 1. http://geodesic.mathdoc.fr/item/CSIS_2015_12_1_a1/