Poster (Scientific congresses and symposiums)
Identification of Chronic Obstructive Pulmonary Disease Phenotype using PCA and Clustering Methodologies
Nekoee Zahraei, Halehsadat; PAULUS, Virginie; HENKET, Monique et al.
201840th International Society for Clinical Biostatistics (ISCB40)
 

Files


Full Text
Poster1.pdf
Publisher postprint (747.93 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Chronic Obstructive Pulmonary Disease; Cluster analysis; Missing value; Factor analysis of mixed data
Abstract :
[en] Context The approach to classification of chronic obstructive pulmonary disease patients is complicated by the heterogeneous and multidimensional nature of the disease. In the past years, application of classification methods in this context have been developed by limited and selected variables to define phenotypes with comparable results. Objective In the present study, we aim to identity distinct phenotypes of adults suffering from chronic obstructive pulmonary disease (COPD). Clustering was applied to understanding, management and better predict future risks and optimize treatment selection based on the new groupings of patients. Method In this application, patients were described by multiple and huge sets of variables that structured into groups. Two step approach was used for reduction of the huge variables and identification of the cluster. In the first step, the choice of variables to input into the clustering algorithm was one of the most important considerations. Most of previous studies employed manual variable extraction based on expert advice or dimension reduction techniques such as PCA. However, in this study we used multiple factor analysis (MFA), which is an extension of principal component analysis (PCA), for reducing the complexity of high-dimensional data. After this first step, a hierarchical and partition clustering method around medoids based on Jensen–Shannon distance was applied to the multiple factor analysis. This method is more robust to noise and outliers. All statistical analyses were performed using R software. Results The best cluster number was found to be equal to 5. Those clusters were identified as: mild COPD, Eosinophilic COPD, Atopic COPD, COPD + Emphysema and neutrophilic COPD + Emphysema + Systemic inflammation. Use of other criterion to compute distance didn’t present a significant improvement in the results obtained from clinical data. Conclusions Application of this new cluster approach allowed to identify five groups within stable COPD patients based on huge clinical variables while accounting for noise and outlier. Derived results can be used to predict outcomes of patients with COPD and to aid in development of personalized therapy.
Disciplines :
Public health, health care sciences & services
Author, co-author :
Nekoee Zahraei, Halehsadat ;  Université de Liège - ULiège > Département des sciences de la santé publique > Biostatistique
PAULUS, Virginie ;  Centre Hospitalier Universitaire de Liège - CHU > Département de médecine interne > Clinique de l'asthme
HENKET, Monique ;  Centre Hospitalier Universitaire de Liège - CHU > Département de médecine interne > Clinique de l'asthme
Louis, Renaud ;  Université de Liège - ULiège > Département des sciences cliniques > Pneumologie - Allergologie
Donneau, Anne-Françoise ;  Université de Liège - ULiège > Département des sciences de la santé publique > Biostatistique
Language :
English
Title :
Identification of Chronic Obstructive Pulmonary Disease Phenotype using PCA and Clustering Methodologies
Publication date :
17 July 2018
Event name :
40th International Society for Clinical Biostatistics (ISCB40)
Event place :
Leuven, Belgium
Event date :
from 14-07-2019 to 18-07-2019
Audience :
International
Available on ORBi :
since 11 October 2019

Statistics


Number of views
55 (13 by ULiège)
Number of downloads
34 (7 by ULiège)

Bibliography


Similar publications



Contact ORBi