Research on Discovering Deep Web Entries
Computer Science and Information Systems, Tome 8 (2011) no. 3
Cet article a éte moissonné depuis la source Computer Science and Information Systems website
Ontology plays an important role in locating Domain-Specific Deep Web contents, therefore, this paper presents a novel framework WFF for efficiently locating Domain-Specific Deep Web databases based on focused crawling and ontology by constructing Web Page Classifier(WPC), Form Structure Classifier(FSC) and Form Content Classifier(FCC) in a hierarchical fashion. Firstly, WPC discovers potentially interesting pages based on ontology-assisted focused crawler. Then, FSC analyzes the interesting pages and determines whether these pages subsume searchable forms based on structural characteristics. Lastly, FCC identifies searchable forms that belong to a given domain in the semantic level, and stores these URLs of Domain-Specific searchable forms to a database. Through a detailed experimental evaluation, WFF framework not only simplifies discovering process, but also effectively determines Domain-Specific databases.
Keywords:
Deep Web, ontology, WPC, FSC, FCC
@article{CSIS_2011_8_3_a13,
author = {Ying Wang and Huilai Li and Wanli Zuo and Fengling He and Xin Wang and Kerui Chen},
title = {Research on {Discovering} {Deep} {Web} {Entries}},
journal = {Computer Science and Information Systems},
year = {2011},
volume = {8},
number = {3},
url = {http://geodesic.mathdoc.fr/item/CSIS_2011_8_3_a13/}
}
TY - JOUR AU - Ying Wang AU - Huilai Li AU - Wanli Zuo AU - Fengling He AU - Xin Wang AU - Kerui Chen TI - Research on Discovering Deep Web Entries JO - Computer Science and Information Systems PY - 2011 VL - 8 IS - 3 UR - http://geodesic.mathdoc.fr/item/CSIS_2011_8_3_a13/ ID - CSIS_2011_8_3_a13 ER -
Ying Wang; Huilai Li; Wanli Zuo; Fengling He; Xin Wang; Kerui Chen. Research on Discovering Deep Web Entries. Computer Science and Information Systems, Tome 8 (2011) no. 3. http://geodesic.mathdoc.fr/item/CSIS_2011_8_3_a13/