Paper published in a book (Scientific congresses and symposiums)
L1-based compression of random forest models
Joly, Arnaud; Schnitzler, François; Geurts, Pierre et al.
2012In Proceeding of the 21st Belgian-Dutch Conference on Machine Learning
 

Files


Full Text
benelearn2012abstract.pdf
Author postprint (110.14 kB)
Download
Annexes
slide.pdf
(810.64 kB)
Visual support for the oral presentation
Download
speech.odt
(18.71 kB)
Speech of the oral presentation
Request a copy

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Ensemble of randomized trees; Pruning; L1-norm regularization; LASSO; Supervised learning; Machine Learning; Randomization; Model reduction; Decision tree
Abstract :
[en] Random forests are effective supervised learning methods applicable to large-scale datasets. However, the space complexity of tree ensembles, in terms of their total number of nodes, is often prohibitive, specially in the context of problems with very high-dimensional input spaces. We propose to study their compressibility by applying a L1-based regularization to the set of indicator functions defined by all their nodes. We show experimentally that preserving or even improving the model accuracy while significantly reducing its space complexity is indeed possible.
Research center :
Systèmes et modélisation
GIGA‐R - Giga‐Research - ULiège
Disciplines :
Electrical & electronics engineering
Computer science
Author, co-author :
Joly, Arnaud ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Schnitzler, François ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Geurts, Pierre ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Wehenkel, Louis  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Language :
English
Title :
L1-based compression of random forest models
Publication date :
24 May 2012
Event name :
Belgian-Dutch Conference on Machine Learning & Predictive Modeling for the Life Sciences 2012
Event organizer :
Faculty of Bioscience Engineering of Ghent University
Event place :
Ghent, Belgium
Event date :
from 24 may 2012 to 25 may 2012
Main work title :
Proceeding of the 21st Belgian-Dutch Conference on Machine Learning
ISBN/EAN :
978-94-6197-044-2
Funders :
FRIA - Fonds pour la Formation à la Recherche dans l'Industrie et dans l'Agriculture [BE]
BELSPO - Belgian Science Policy Office [BE]
EU - European Union [BE]
Funding text :
F. Schnitzler is supported by a F.R.I.A. scholarship. This work was funded by the Biomagnet IUAP network of the Belgian Science Policy Office and the Pascal2 network of excellence of the EC.
Commentary :
An extended abstract presenting the article "Joly, A., Schnitzler, F., Geurts, P., & Wehenkel, L. (2012). L1-based compression of random forest models. 20th European Symposium on Artificial Neural Networks." which leads also to an oral presentation.
Available on ORBi :
since 10 June 2012

Statistics


Number of views
337 (59 by ULiège)
Number of downloads
263 (17 by ULiège)

Bibliography


Similar publications



Contact ORBi