Chronic Obstructive Pulmonary Disease; Cluster analysis; Missing value; Factor analysis of mixed data
Abstract :
[en] Context
The approach to classification of chronic obstructive pulmonary disease patients is complicated by the heterogeneous and multidimensional nature of the disease. In the past years, application of classification methods in this context have been developed by limited and selected variables to define phenotypes with comparable results.
Objective
In the present study, we aim to identity distinct phenotypes of adults suffering from chronic obstructive pulmonary disease (COPD). Clustering was applied to understanding, management and better predict future risks and optimize treatment selection based on the new groupings of patients.
Method
In this application, patients were described by multiple and huge sets of variables that structured into groups. Two step approach was used for reduction of the huge variables and identification of the cluster. In the first step, the choice of variables to input into the clustering algorithm was one of the most important considerations. Most of previous studies employed manual variable extraction based on expert advice or dimension reduction techniques such as PCA. However, in this study we used multiple factor analysis (MFA), which is an extension of principal component analysis (PCA), for reducing the complexity of high-dimensional data. After this first step, a hierarchical and partition clustering method around medoids based on Jensen–Shannon distance was applied to the multiple factor analysis. This method is more robust to noise and outliers. All statistical analyses were performed using R software.
Results
The best cluster number was found to be equal to 5. Those clusters were identified as: mild COPD, Eosinophilic COPD, Atopic COPD, COPD + Emphysema and neutrophilic COPD + Emphysema + Systemic inflammation. Use of other criterion to compute distance didn’t present a significant improvement in the results obtained from clinical data.
Conclusions
Application of this new cluster approach allowed to identify five groups within stable COPD patients based on huge clinical variables while accounting for noise and outlier. Derived results can be used to predict outcomes of patients with COPD and to aid in development of personalized therapy.
Disciplines :
Public health, health care sciences & services
Author, co-author :
Nekoee Zahraei, Halehsadat ; Université de Liège - ULiège > Département des sciences de la santé publique > Biostatistique
PAULUS, Virginie ; Centre Hospitalier Universitaire de Liège - CHU > Département de médecine interne > Clinique de l'asthme
HENKET, Monique ; Centre Hospitalier Universitaire de Liège - CHU > Département de médecine interne > Clinique de l'asthme
Louis, Renaud ; Université de Liège - ULiège > Département des sciences cliniques > Pneumologie - Allergologie
Donneau, Anne-Françoise ; Université de Liège - ULiège > Département des sciences de la santé publique > Biostatistique
Language :
English
Title :
Identification of Chronic Obstructive Pulmonary Disease Phenotype using PCA and Clustering Methodologies
Publication date :
17 July 2018
Event name :
40th International Society for Clinical Biostatistics (ISCB40)
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.