The Efficient Implementation of Distributed Indexing with Hadoop for Digital Investigations on Big Data
Computer Science and Information Systems, Tome 11 (2014) no. 3.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

Big Data brings new challenges to the field of e-Discovery or digital forensics and these challenges are mostly connected to the various methods for data processing. Considering that the most important factors are time and cost in determining success or failure of digital investigation, the development of a valid indexing method for efficient search should come first to more quickly and accurately find relevant evidence from Big Data. This paper, therefore, introduces a Distributed Text Processing System based on Hadoop called DTPS and explains about the distinctions between DTPS and other related researches to emphasize the necessity of it. In addition, this paper describes various experimental results in order to find the best implementation strategy in using Hadoop MapReduce for the distributed indexing and to analyze the worth for practical use of DTPS by comparative evaluation of its performance with similar tools. To be short, the ultimate purpose of this research is the development of useful search engine specially aimed at Big Data indexing as a major part for the future e-Discovery cloud service.
Keywords: Electronic Discovery, e-Discovery, Digital Forensics, Evidence Search, Indexing Performance, Hadoop MapReduce, Distributed Indexing
@article{CSIS_2014_11_3_a8,
     author = {Taerim Lee and Hyejoo Lee and Kyung-Hyune Rhee and Sang Uk Shin},
     title = {The {Efficient} {Implementation} of {Distributed} {Indexing} with {Hadoop} for {Digital} {Investigations} on {Big} {Data}},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {11},
     number = {3},
     year = {2014},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2014_11_3_a8/}
}
TY  - JOUR
AU  - Taerim Lee
AU  - Hyejoo Lee
AU  - Kyung-Hyune Rhee
AU  - Sang Uk Shin
TI  - The Efficient Implementation of Distributed Indexing with Hadoop for Digital Investigations on Big Data
JO  - Computer Science and Information Systems
PY  - 2014
VL  - 11
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2014_11_3_a8/
ID  - CSIS_2014_11_3_a8
ER  - 
%0 Journal Article
%A Taerim Lee
%A Hyejoo Lee
%A Kyung-Hyune Rhee
%A Sang Uk Shin
%T The Efficient Implementation of Distributed Indexing with Hadoop for Digital Investigations on Big Data
%J Computer Science and Information Systems
%D 2014
%V 11
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2014_11_3_a8/
%F CSIS_2014_11_3_a8
Taerim Lee; Hyejoo Lee; Kyung-Hyune Rhee; Sang Uk Shin. The Efficient Implementation of Distributed Indexing with Hadoop for Digital Investigations on Big Data. Computer Science and Information Systems, Tome 11 (2014) no. 3. http://geodesic.mathdoc.fr/item/CSIS_2014_11_3_a8/