Research on Discovering Deep Web Entries
Computer Science and Information Systems, Tome 8 (2011) no. 3.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

Ontology plays an important role in locating Domain-Specific Deep Web contents, therefore, this paper presents a novel framework WFF for efficiently locating Domain-Specific Deep Web databases based on focused crawling and ontology by constructing Web Page Classifier(WPC), Form Structure Classifier(FSC) and Form Content Classifier(FCC) in a hierarchical fashion. Firstly, WPC discovers potentially interesting pages based on ontology-assisted focused crawler. Then, FSC analyzes the interesting pages and determines whether these pages subsume searchable forms based on structural characteristics. Lastly, FCC identifies searchable forms that belong to a given domain in the semantic level, and stores these URLs of Domain-Specific searchable forms to a database. Through a detailed experimental evaluation, WFF framework not only simplifies discovering process, but also effectively determines Domain-Specific databases.
Keywords: Deep Web, ontology, WPC, FSC, FCC
@article{CSIS_2011_8_3_a13,
     author = {Ying Wang and Huilai Li and Wanli Zuo and Fengling He and Xin Wang and Kerui Chen},
     title = {Research on {Discovering} {Deep} {Web} {Entries}},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {8},
     number = {3},
     year = {2011},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2011_8_3_a13/}
}
TY  - JOUR
AU  - Ying Wang
AU  - Huilai Li
AU  - Wanli Zuo
AU  - Fengling He
AU  - Xin Wang
AU  - Kerui Chen
TI  - Research on Discovering Deep Web Entries
JO  - Computer Science and Information Systems
PY  - 2011
VL  - 8
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2011_8_3_a13/
ID  - CSIS_2011_8_3_a13
ER  - 
%0 Journal Article
%A Ying Wang
%A Huilai Li
%A Wanli Zuo
%A Fengling He
%A Xin Wang
%A Kerui Chen
%T Research on Discovering Deep Web Entries
%J Computer Science and Information Systems
%D 2011
%V 8
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2011_8_3_a13/
%F CSIS_2011_8_3_a13
Ying Wang; Huilai Li; Wanli Zuo; Fengling He; Xin Wang; Kerui Chen. Research on Discovering Deep Web Entries. Computer Science and Information Systems, Tome 8 (2011) no. 3. http://geodesic.mathdoc.fr/item/CSIS_2011_8_3_a13/