Doctoral thesis (Dissertations and theses)
Importance measures derived from random forests: characterisation and extension
Sutera, Antonio
2019
 

Files


Full Text
phd_thesis_Antonio_Sutera.pdf
Publisher postprint (2.35 MB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
machine learning; random forest; variable importances
Abstract :
[en] Nowadays new technologies, and especially artificial intelligence, are more and more established in our society. Big data analysis and machine learning, two sub-fields of artificial intelligence, are at the core of many recent breakthroughs in many application fields (e.g., medicine, communication, finance, ...), including some that are strongly related to our day-to-day life (e.g., social networks, computers, smartphones, ...). In machine learning, significant improvements are usually achieved at the price of an increasing computational complexity and thanks to bigger datasets. Currently, cutting-edge models built by the most advanced machine learning algorithms typically became simultaneously very efficient and profitable but also extremely complex. Their complexity is to such an extent that these models are commonly seen as black-boxes providing a prediction or a decision which can not be interpreted or justified. Nevertheless, whether these models are used autonomously or as a simple decision-making support tool, they are already being used in machine learning applications where health and human life are at stake. Therefore, it appears to be an obvious necessity not to blindly believe everything coming out of those models without a detailed understanding of their predictions or decisions. Accordingly, this thesis aims at improving the interpretability of models built by a specific family of machine learning algorithms, the so-called tree-based methods. Several mechanisms have been proposed to interpret these models and we aim along this thesis to improve their understanding, study their properties, and define their limitations.
Disciplines :
Engineering, computing & technology: Multidisciplinary, general & others
Author, co-author :
Sutera, Antonio ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Algorith. des syst. en interaction avec le monde physique
Language :
English
Title :
Importance measures derived from random forests: characterisation and extension
Defense date :
13 June 2019
Number of pages :
193+46
Institution :
ULiège - Université de Liège
Degree :
Docteur en Sciences de l'ingénieur
Promotor :
Geurts, Pierre ;  Université de Liège - ULiège > Montefiore Institute of Electrical Engineering and Computer Science
Wehenkel, Louis  ;  Université de Liège - ULiège > Montefiore Institute of Electrical Engineering and Computer Science
President :
Louppe, Gilles  ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Big Data
Jury member :
Meyer, Patrick ;  Université de Liège - ULiège > Integrative Biological Sciences (InBioS)
Frénay, Benoît
Scornet, Erwan
Genuer, Robin
Funders :
Fédération Wallonie Bruxelles. Fonds de la Recherche Scientifique - F.R.S.-FNRS
Available on ORBi :
since 14 June 2019

Statistics


Number of views
264 (66 by ULiège)
Number of downloads
151 (35 by ULiège)

Bibliography


Similar publications



Contact ORBi