Reference : Recent Advances in Batch Mode Reinforcement Learning: Synthesizing Artificial Trajectories
Diverse speeches and writings : Speech/Talk
Engineering, computing & technology : Computer science
Recent Advances in Batch Mode Reinforcement Learning: Synthesizing Artificial Trajectories
Fonteneau, Raphaël mailto [Université de Liège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation >]
Grascomp's Day
November 3th, 2011
GRASCOMP Graduate School in Computing Science
[en] Reinforcement Learning
[en] Batch mode reinforcement learning (BMRL) is a field of research which focuses on the inference of high-performance control policies when the only information on the control problem is gathered in a set of trajectories. When the (state, action) spaces are large or continuous, most of the techniques proposed in the literature for solving BMRL problems combine value or policy iteration schemes from the Dynamic Programming (DP) theory with function approximators representing (state-action) value functions. While successful in many studies, the use of function approximators for solving BMRL problems has also drawbacks. In particular, the use of function approximator makes performance guarantees difficult to obtain, and does not systematically take advantage of optimal trajectories. In this talk, I will present a new line of research for solving BMRL problems based on the synthesis of ``artificial trajectories'' which opens avenues for desiging new BMRL algorithms. In particular, it avoids the two above-mentioned drawbacks of the use of function approximator.

File(s) associated to this reference

Fulltext file(s):

Open access
03November2011@GRASCOMP.pdfAuthor postprint2.5 MBView/Open

Bookmark and Share SFX Query

All documents in ORBi are protected by a user license.