On Approximate k-Nearest Neighbor Searches Based on the Earth Mover’s Distance for Efficient Content-Based Multimedia Information Retrieval
Computer Science and Information Systems, Tome 16 (2019) no. 2.

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

The Earth Mover's Distance (EMD) is one of the most-widely used distance functions to measure the similarity between two multimedia objects. While providing good search results, the EMD is too much time consuming to be used in large multimedia databases. To solve the problem, we propose an approximate k-nearest neighbor (k-NN) search method based on the EMD. In the proposed method, the overhead for both disk accesses and EMD computations is reduced significantly, thanks to the approximation. First, the proposed method builds an index using the M-tree, a distance-based multi-dimensional index structure, to reduce the disk access overhead. When building the index, we reduce the number of features in the multimedia objects through dimensionality-reduction. When performing the k-NN search on the M-tree, we find a small set of candidates from the disk using the index and then perform the post-processing on them. Second, the proposed method uses the approximate EMD for index retrieval and post-processing to reduce the computational overhead of the EMD. To compensate the errors due to the approximation, the method provides a way of accuracy improvement of the approximate EMD. We performed extensive experiments to show the efficiency of the proposed method. As a result, the method achieves significant improvement in performance with only small errors: the proposed method outperforms the previous method by up to 67.3% with only 3.5% error.
Keywords: Earth mover's distance, content-based information retrieval, k-nearest neighbor query
@article{CSIS_2019_16_2_a13,
     author = {Min-Hee Jang and Sang-Wook Kim and Woong-Kee Loh and Jung-Im Won},
     title = {On {Approximate} {k-Nearest} {Neighbor} {Searches} {Based} on the {Earth} {Mover{\textquoteright}s} {Distance} for {Efficient} {Content-Based} {Multimedia} {Information} {Retrieval}},
     journal = {Computer Science and Information Systems},
     publisher = {mathdoc},
     volume = {16},
     number = {2},
     year = {2019},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2019_16_2_a13/}
}
TY  - JOUR
AU  - Min-Hee Jang
AU  - Sang-Wook Kim
AU  - Woong-Kee Loh
AU  - Jung-Im Won
TI  - On Approximate k-Nearest Neighbor Searches Based on the Earth Mover’s Distance for Efficient Content-Based Multimedia Information Retrieval
JO  - Computer Science and Information Systems
PY  - 2019
VL  - 16
IS  - 2
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/CSIS_2019_16_2_a13/
ID  - CSIS_2019_16_2_a13
ER  - 
%0 Journal Article
%A Min-Hee Jang
%A Sang-Wook Kim
%A Woong-Kee Loh
%A Jung-Im Won
%T On Approximate k-Nearest Neighbor Searches Based on the Earth Mover’s Distance for Efficient Content-Based Multimedia Information Retrieval
%J Computer Science and Information Systems
%D 2019
%V 16
%N 2
%I mathdoc
%U http://geodesic.mathdoc.fr/item/CSIS_2019_16_2_a13/
%F CSIS_2019_16_2_a13
Min-Hee Jang; Sang-Wook Kim; Woong-Kee Loh; Jung-Im Won. On Approximate k-Nearest Neighbor Searches Based on the Earth Mover’s Distance for Efficient Content-Based Multimedia Information Retrieval. Computer Science and Information Systems, Tome 16 (2019) no. 2. http://geodesic.mathdoc.fr/item/CSIS_2019_16_2_a13/