Article (Scientific journals)
RelaxMCD: smooth optimisation for the Minimum Covariance Determinant estimator
Schyns, Michael; Haesbroeck, Gentiane; Critchley, Frank
2010In Computational Statistics and Data Analysis, 54 (4), p. 843-857
Peer Reviewed verified by ORBi
 

Files


Full Text
relaxMCD_final.pdf
Author postprint (397.87 kB)
Request a copy

The original publication is available at www.sciencedirect.com (csda)


All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
MCD estimator; resampling algorithms; k-means; robustness
Abstract :
[en] The Minimum Covariance Determinant (MCD) estimator is a highly robust procedure for estimating the center and shape of a high dimensional data set. It consists of determining a subsample of h points out of n which minimizes the generalized variance. By definition, the computation of this estimator gives rise to a combinatorial optimization problem, for which several approximative algorithms have been developed. Some of these approximations are quite powerful, but they do not take advantage of any smoothness in the objective function. In this paper, focus is on the approach outlined in a general framework in Critchley et al. (2009) and which transforms any discrete and high dimensional combinatorial problem of this type into a continuous and low-dimensional one. The idea is to build on the general algorithm proposed by Critchley et al. (2009) in order to take into account the particular features of the MCD methodology. More specifically, both the adaptation of the algorithm to the specific MCD target function as well as the comparison of this “specialized” algorithm with the usual competitors for computing MCD are the main goals of this paper. The adaptation focuses on the design of “clever” starting points in order to systematically investigate the search domain. Accordingly, a new and surprisingly efficient procedure based on the well known k-means algorithm is constructed. The adapted algorithm, called RelaxMCD, is then compared by means of simulations and examples with FASTMCD and the Feasible Subset Algorithm, both benchmark algorithms for computing MCD. As a by-product, it is shown that RelaxMCD is a general technique encompassing the two others, yielding insight about their overall good performance.
Research center :
QuantOM
Disciplines :
Mathematics
Author, co-author :
Schyns, Michael ;  Université de Liège - ULiège > HEC - École de gestion de l'ULiège > Informatique de gestion
Haesbroeck, Gentiane ;  Université de Liège - ULiège > Département de mathématique > Statistique (aspects théoriques)
Critchley, Frank;  The Open University > Department of Mathematics and Statistics
Language :
English
Title :
RelaxMCD: smooth optimisation for the Minimum Covariance Determinant estimator
Publication date :
April 2010
Journal title :
Computational Statistics and Data Analysis
ISSN :
0167-9473
eISSN :
1872-7352
Publisher :
Elsevier Science, Amsterdam, Netherlands
Volume :
54
Issue :
4
Pages :
843-857
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBi :
since 04 May 2009

Statistics


Number of views
252 (44 by ULiège)
Number of downloads
8 (7 by ULiège)

Scopus citations®
 
21
Scopus citations®
without self-citations
21
OpenCitations
 
20

Bibliography


Similar publications



Contact ORBi