Speech/Talk (Diverse speeches and writings)
Recent Advances in Batch Mode Reinforcement Learning: Synthesizing Artificial Trajectories
Fonteneau, Raphaël
2011
 

Files


Full Text
03November2011@GRASCOMP.pdf
Author postprint (2.56 MB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Reinforcement Learning
Abstract :
[en] Batch mode reinforcement learning (BMRL) is a field of research which focuses on the inference of high-performance control policies when the only information on the control problem is gathered in a set of trajectories. When the (state, action) spaces are large or continuous, most of the techniques proposed in the literature for solving BMRL problems combine value or policy iteration schemes from the Dynamic Programming (DP) theory with function approximators representing (state-action) value functions. While successful in many studies, the use of function approximators for solving BMRL problems has also drawbacks. In particular, the use of function approximator makes performance guarantees difficult to obtain, and does not systematically take advantage of optimal trajectories. In this talk, I will present a new line of research for solving BMRL problems based on the synthesis of ``artificial trajectories'' which opens avenues for desiging new BMRL algorithms. In particular, it avoids the two above-mentioned drawbacks of the use of function approximator.
Disciplines :
Computer science
Author, co-author :
Fonteneau, Raphaël ;  Université de Liège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Language :
English
Title :
Recent Advances in Batch Mode Reinforcement Learning: Synthesizing Artificial Trajectories
Publication date :
03 November 2011
Event name :
Grascomp's Day
Event organizer :
GRASCOMP Graduate School in Computing Science
Event place :
Bruxelles, Belgium
Event date :
November 3th, 2011
Available on ORBi :
since 13 May 2015

Statistics


Number of views
31 (3 by ULiège)
Number of downloads
46 (2 by ULiège)

Bibliography


Similar publications



Contact ORBi