Article (Scientific journals)
Distributed learning: Developing a predictive model based on data from multiple hospitals without data leaving the hospital – A real life proof of concept
JOCHEMS, Arthur; DEIST, Timo M.; VAN SOEST, Johan et al.
2016In Radiotherapy and Oncology
Peer Reviewed verified by ORBi
 

Files


Full Text
Distributed learning Developing a predictive model based on data from multiple hospitals without data leaving the hosital - a real life proof of concept.pdf
Publisher postprint (902.69 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Bayesian networks; Distributed learning; Privacy preserving data-mining; Dyspnea; Machine learning
Abstract :
[en] Purpose: One of the major hurdles in enabling personalized medicine is obtaining sufficient patient data to feed into predictive models. Combining data originating from multiple hospitals is difficult because of ethical, legal, political, and administrative barriers associated with data sharing. In order to avoid these issues, a distributed learning approach can be used. Distributed learning is defined as learning from data without the data leaving the hospital. Patients and methods:Clinical data from 287 lung cancer patients, treated with curative intent with chemoradiation (CRT) or radiotherapy (RT) alone were collected from and stored in 5 different medical institutes (123 patients at MAASTRO (Netherlands, Dutch), 24 at Jessa (Belgium, Dutch), 34 at Liege (Belgium, Dutch and French), 48 at Aachen (Germany, German) and 58 at Eindhoven (Netherlands, Dutch)). A Bayesian network model is adapted for distributed learning (watch the animation: http://youtu.be/nQpqMIuHyOk). The model predicts dyspnea, which is a common side effect after radiotherapy treatment of lung cancer. Results:We show that it is possible to use the distributed learning approach to train a Bayesian network model on patient data originating from multiple hospitals without these data leaving the individual hospital. The AUC of the model is 0.61 (95%CI, 0.51–0.70) on a 5-fold cross-validation and ranges from 0.59 to 0.71 on external validation sets. Conclusion: Distributed learning can allow the learning of predictive models on data originating from multiple hospitals while avoiding many of the data sharing barriers. Furthermore, the distributed learning approach can be used to extract and employ knowledge from routine patient data from multiple hospitals while being compliant to the various national and European privacy laws.
Disciplines :
Oncology
Author, co-author :
JOCHEMS, Arthur;  Department of Radiation Oncology (MAASTRO Clinic)
DEIST, Timo M.;  Department of Radiation Oncology (MAASTRO Clinic)
VAN SOEST, Johan;  Department of Radiation Oncology (MAASTRO Clinic)
ELBE, Michael;  University clinic Aachen
BULENS, Paul;  jessa hopital > Departement of Radiation Oncology
COUCKE, Philippe  ;  Centre Hospitalier Universitaire de Liège - CHU > Service médical de radiothérapie
DRIES, Wim;  Catharina-Hospital Eindhoven
LAMBIN, Philippe;  Universiteit Maastricht > Oncology and developmental biology
DEKKER, André;  MAASTRO CLINIC > Radiation Oncology
Language :
English
Title :
Distributed learning: Developing a predictive model based on data from multiple hospitals without data leaving the hospital – A real life proof of concept
Alternative titles :
[fr] Apprentissage distribué: Élaboration d'un modèle prédictif basé sur les données de plusieurs hôpitaux sans données sortant de l'hôpital - Une preuve de concept réelle
Publication date :
03 October 2016
Journal title :
Radiotherapy and Oncology
ISSN :
0167-8140
eISSN :
1879-0887
Publisher :
Elsevier Scientific, Limerick, Ireland
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBi :
since 26 January 2017

Statistics


Number of views
95 (7 by ULiège)
Number of downloads
169 (5 by ULiège)

Scopus citations®
 
135
Scopus citations®
without self-citations
113
OpenCitations
 
123

Bibliography


Similar publications



Contact ORBi