L1-based compression of random forest models

Joly, Arnaud; Schnitzler, François; Geurts, Pierre; Wehenkel, Louis

Download

Paper published in a book (Scientific congresses and symposiums)

L1-based compression of random forest models

Joly, Arnaud; Schnitzler, François; Geurts, Pierre et al.

2012 • In Proceeding of the 21st Belgian-Dutch Conference on Machine Learning

Permalink
https://hdl.handle.net/2268/124834

Files (3)Send to Details Statistics Bibliography Similar publications

Files

Full Text

benelearn2012abstract.pdf

Author postprint (110.14 kB)

Download

Annexes

slide.pdf

(810.64 kB)

Visual support for the oral presentation

Download

speech.odt

(18.71 kB)

Speech of the oral presentation

Request a copy

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Ensemble of randomized trees; Pruning; L1-norm regularization; LASSO; Supervised learning; Machine Learning; Randomization; Model reduction; Decision tree

Abstract :

[en] Random forests are effective supervised learning methods applicable to large-scale datasets. However, the space complexity of tree ensembles, in terms of their total number of nodes, is often prohibitive, specially in the context of problems with very high-dimensional input spaces. We propose to study their compressibility by applying a L1-based regularization to the set of indicator functions defined by all their nodes. We show experimentally that preserving or even improving the model accuracy while significantly reducing its space complexity is indeed possible.

Research Center/Unit :

Systèmes et modélisation
GIGA‐R - Giga‐Research - ULiège

Disciplines :

Electrical & electronics engineering
Computer science

Author, co-author :

Joly, Arnaud ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Schnitzler, François ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Geurts, Pierre ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Wehenkel, Louis ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Language :

English

Title :

L1-based compression of random forest models

Publication date :

24 May 2012

Event name :

Belgian-Dutch Conference on Machine Learning & Predictive Modeling for the Life Sciences 2012

Event organizer :

Faculty of Bioscience Engineering of Ghent University

Event place :

Ghent, Belgium

Event date :

from 24 may 2012 to 25 may 2012

Main work title :

Proceeding of the 21st Belgian-Dutch Conference on Machine Learning

ISBN/EAN :

978-94-6197-044-2

Additional URL :

http://hdl.handle.net/2268/112824

Funders :

FRIA - Fonds pour la Formation à la Recherche dans l'Industrie et dans l'Agriculture
BELSPO - Belgian Science Policy Office
EU - European Union

Funding text :

F. Schnitzler is supported by a F.R.I.A. scholarship. This work was funded by the Biomagnet IUAP network of the Belgian Science Policy Office and the Pascal2 network of excellence of the EC.

Commentary :

An extended abstract presenting the article "Joly, A., Schnitzler, F., Geurts, P., & Wehenkel, L. (2012). L1-based compression of random forest models. 20th European Symposium on Artificial Neural Networks." which leads also to an oral presentation.

Available on ORBi :

since 10 June 2012

Statistics

Number of views

440 (59 by ULiège)

Number of downloads

324 (18 by ULiège)

More statistics