Classification efficiency of the trimmed k-means procedure

[en] The k-means method is used in classification to group similar observations in k groups. When a second sample is available to test the obtained groupings, the rate of misclassification can be computed. If the samples are generated from a mixture of two homoscedastic and spherically symmetric distributions, the rate of misclassification equals that of the Bayes rule. Therefore, the k-means method is optimal under such a mixture model. However, it is not robust with respect to outliers in the dataset used to construct the groups. To avoid this problem, the k-means procedure has been adapted in many ways. This presentation focuses on the trimmed k-means method defined by trimming some of the observations. The advantage of this method, besides its resistance to outliers, is that optimality is preserved. However, it is well known that trimming observations leads to a loss in classification efficiency. The latter can be measured by means of the influence function of the misclassifiation rate.

Disciplines :

Mathematics

Author, co-author :

Ruwet, Christel ; Université de Liège - ULiège > Département de mathématique > Statistique mathématique

Language :

English

Title :

Classification efficiency of the trimmed k-means procedure

Alternative titles :

[fr] Efficacité de classification de la méthode des k-moyennes tronquées

Publication date :

21 May 2012

Event name :

44e Journées de Statistique

Event organizer :

Société Française de Statistique

Event place :

Bruxelles, Belgium

Event date :

21-25 mai 2012

Audience :

International

Available on ORBi :

since 12 August 2013

Statistics

Number of views

65 (7 by ULiège)

Number of downloads

482 (5 by ULiège)

More statistics

Bibliography

Similar publications

Sorry the service is unavailable at the moment. Please try again later.

Name

Provider / Domaine

Expiration

Description

JSESSIONID

Oracle Corporation

www.uliege.be

Session

General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.

CookieScriptConsent

CookieScript

.uliege.be

1 year

This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.

Name

Provider / Domaine

Expiration

Description

_pk_id

InnoCraft Ltd

.uliege.be

1 year

Used to store a few details about the user such as the unique visitor ID

_pk_ses

InnoCraft Ltd

.uliege.be

30 minutes

Short lived cookies used to temporarily store data for the visit

_pk_ref

InnoCraft Ltd

.uliege.be

6 months

Used to store the attribution information, the referrer initially used to visit the website

Name	Provider / Domaine	Expiration	Description
JSESSIONID	Oracle Corporation www.uliege.be	Session	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.
CookieScriptConsent	CookieScript .uliege.be	1 year	This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.

Name	Provider / Domaine	Expiration	Description
_pk_id	InnoCraft Ltd .uliege.be	1 year	Used to store a few details about the user such as the unique visitor ID
_pk_ses	InnoCraft Ltd .uliege.be	30 minutes	Short lived cookies used to temporarily store data for the visit
_pk_ref	InnoCraft Ltd .uliege.be	6 months	Used to store the attribution information, the referrer initially used to visit the website