Detection of influential points by convex hull volume minimization
Kybernetika, Tome 34 (1998) no. 5, pp. 515-534
Cet article a éte moissonné depuis la source Czech Digital Mathematics Library

Voir la notice de l'article

A method of geometrical characterization of multidimensional data sets, including construction of the convex hull of the data and calculation of the volume of the convex hull, is described. This technique, together with the concept of minimum convex hull volume, can be used for detection of influential points or outliers in multiple linear regression. An approximation to the true concept is achieved by ordering the data into a linear sequence such that the volume of the convex hull of the first $n$ terms in the sequence grows as slowly as possible with $n$. The performance of the method is demonstrated on four well known data sets. The average computational complexity needed for the ordering is estimated by $O(N^{2+(p-1)/(p+1)})$ for large $N$, where $N$ is the number of observations and $p$ is the data dimension, i. e. the number of predictors plus 1.
A method of geometrical characterization of multidimensional data sets, including construction of the convex hull of the data and calculation of the volume of the convex hull, is described. This technique, together with the concept of minimum convex hull volume, can be used for detection of influential points or outliers in multiple linear regression. An approximation to the true concept is achieved by ordering the data into a linear sequence such that the volume of the convex hull of the first $n$ terms in the sequence grows as slowly as possible with $n$. The performance of the method is demonstrated on four well known data sets. The average computational complexity needed for the ordering is estimated by $O(N^{2+(p-1)/(p+1)})$ for large $N$, where $N$ is the number of observations and $p$ is the data dimension, i. e. the number of predictors plus 1.
Classification : 62H10, 62J05, 90C59, 94A13
Keywords: multiple linear regression; detection of influential points
@article{KYB_1998_34_5_a1,
     author = {Tichavsk\'y, Petr and Bo\v{c}ek, Pavel},
     title = {Detection of influential points by convex hull volume minimization},
     journal = {Kybernetika},
     pages = {515--534},
     year = {1998},
     volume = {34},
     number = {5},
     mrnumber = {1663724},
     zbl = {1274.94019},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/KYB_1998_34_5_a1/}
}
TY  - JOUR
AU  - Tichavský, Petr
AU  - Boček, Pavel
TI  - Detection of influential points by convex hull volume minimization
JO  - Kybernetika
PY  - 1998
SP  - 515
EP  - 534
VL  - 34
IS  - 5
UR  - http://geodesic.mathdoc.fr/item/KYB_1998_34_5_a1/
LA  - en
ID  - KYB_1998_34_5_a1
ER  - 
%0 Journal Article
%A Tichavský, Petr
%A Boček, Pavel
%T Detection of influential points by convex hull volume minimization
%J Kybernetika
%D 1998
%P 515-534
%V 34
%N 5
%U http://geodesic.mathdoc.fr/item/KYB_1998_34_5_a1/
%G en
%F KYB_1998_34_5_a1
Tichavský, Petr; Boček, Pavel. Detection of influential points by convex hull volume minimization. Kybernetika, Tome 34 (1998) no. 5, pp. 515-534. http://geodesic.mathdoc.fr/item/KYB_1998_34_5_a1/

[2] Barnett V.: The ordering of multivariate dat.

[4] Boček P., Lachout P.: Linear programming approach to LMS–estimation. Comput | Zbl

[6] Buchta C., Müler J.: Random polytopes in a bal.

[7] Chand D. R., Kapur S. S.: An algorithm for convex polytope.

[11] Efron B.: The convex hull of a random set of point.

[12] Hadi A. S.: Identifying multiple outliers in multivariate dat.

[19] Rousseeuw P. J.: Least median of squares regressio.

[22] Rousseeuw P. J., Zomeren B. C. van: Unmasking multivariate outliers and leverage points (with comments). J. Amer. Statist. Assoc