Computing bounds for kernel-based policy evaluation in reinforcement learning

Fonteneau, Raphaël; Murphy, Susan A.; Wehenkel, Louis; Ernst, Damien

Download

External report (Reports)

Computing bounds for kernel-based policy evaluation in reinforcement learning

Fonteneau, Raphaël; Murphy, Susan A.; Wehenkel, Louis et al.

2010

Permalink
https://hdl.handle.net/2268/103545

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

technical_report.pdf

Author postprint (243.17 kB)

Download

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Batch mode reinforcement learning

Abstract :

[en] This technical report proposes an approach for computing bounds on the finite-time return of a policy using kernel-based approximators from a sample of trajectories in a continuous state space and deterministic framework.

Disciplines :

Computer science

Author, co-author :

Fonteneau, Raphaël ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Murphy, Susan A.

Wehenkel, Louis ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Ernst, Damien ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Smart grids

Language :

English

Title :

Computing bounds for kernel-based policy evaluation in reinforcement learning

Publication date :

2010

Publisher :

University of Liège

Funders :

F.R.S.-FNRS - Fonds de la Recherche Scientifique

Available on ORBi :

since 19 November 2011

Statistics

Number of views

74 (4 by ULiège)

Number of downloads

111 (2 by ULiège)

More statistics