Exploring Instances for Matching Heterogeneous Database Schemas Utilizing Google Similarity and Regular Expression
Computer Science and Information Systems, Tome 15 (2018) no. 2.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

Instance based schema matching aims to identify correspondences between different schema attributes. Several approaches have been proposed to discover these correspondences in which instances including those with numeric values are treated as strings. This prevents discovering common patterns or performing statistical computation between numeric instances. Consequently, this causes unidentified matches for numeric instances which further effect the results. In this paper, we propose an approach for addressing the problem of finding matches between schemas of semantically and syntactically related attributes. Since we only fully exploit the instances of the schemas, we rely on strategies that combine the strength of Google as a web semantic and regular expression as pattern recognition. To demonstrate the accuracy of our approach, we have conducted an experimental evaluation using real world datasets. The results show that our approach is able to find 1-1 matches with high accuracy in the range of 93% - 99%. Furthermore, our proposed approach outperformed the previous approaches using a sample of instances.
Keywords: schema matching, instance based schema matching, Google similarity, regular expression
@article{CSIS_2018_15_2_a3,
     author = {Osama A. Mehdi and Hamidah Ibrahim and Lilly Suriani Affendey and Eric Pardede and Jinli Cao},
     title = {Exploring {Instances} for {Matching} {Heterogeneous} {Database} {Schemas} {Utilizing} {Google} {Similarity} and {Regular} {Expression}},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {15},
     number = {2},
     year = {2018},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2018_15_2_a3/}
}
TY  - JOUR
AU  - Osama A. Mehdi
AU  - Hamidah Ibrahim
AU  - Lilly Suriani Affendey
AU  - Eric Pardede
AU  - Jinli Cao
TI  - Exploring Instances for Matching Heterogeneous Database Schemas Utilizing Google Similarity and Regular Expression
JO  - Computer Science and Information Systems
PY  - 2018
VL  - 15
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2018_15_2_a3/
ID  - CSIS_2018_15_2_a3
ER  - 
%0 Journal Article
%A Osama A. Mehdi
%A Hamidah Ibrahim
%A Lilly Suriani Affendey
%A Eric Pardede
%A Jinli Cao
%T Exploring Instances for Matching Heterogeneous Database Schemas Utilizing Google Similarity and Regular Expression
%J Computer Science and Information Systems
%D 2018
%V 15
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2018_15_2_a3/
%F CSIS_2018_15_2_a3
Osama A. Mehdi; Hamidah Ibrahim; Lilly Suriani Affendey; Eric Pardede; Jinli Cao. Exploring Instances for Matching Heterogeneous Database Schemas Utilizing Google Similarity and Regular Expression. Computer Science and Information Systems, Tome 15 (2018) no. 2. http://geodesic.mathdoc.fr/item/CSIS_2018_15_2_a3/