Recent Advances in Batch Mode Reinforcement Learning: Synthesizing Artificial Trajectories

Fonteneau, Raphaël

Download

Speech/Talk (Diverse speeches and writings)

Recent Advances in Batch Mode Reinforcement Learning: Synthesizing Artificial Trajectories

Fonteneau, Raphaël

2011

Permalink
https://hdl.handle.net/2268/181470

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

03November2011@GRASCOMP.pdf

Author postprint (2.56 MB)

Download

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Reinforcement Learning

Abstract :

[en] Batch mode reinforcement learning (BMRL) is a field of research which focuses on the inference of high-performance control policies when the only information on the control problem is gathered in a set of trajectories. When the (state, action) spaces are large or continuous, most of the techniques proposed in the literature for solving BMRL problems combine value or policy iteration schemes from the Dynamic Programming (DP) theory with function approximators representing (state-action) value functions. While successful in many studies, the use of function approximators for solving BMRL problems has also drawbacks. In particular, the use of function approximator makes performance guarantees difficult to obtain, and does not systematically take advantage of optimal trajectories. In this talk, I will present a new line of research for solving BMRL problems based on the synthesis of ``artificial trajectories'' which opens avenues for desiging new BMRL algorithms. In particular, it avoids the two above-mentioned drawbacks of the use of function approximator.

Disciplines :

Computer science

Author, co-author :

Fonteneau, Raphaël ; Université de Liège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Language :

English

Title :

Recent Advances in Batch Mode Reinforcement Learning: Synthesizing Artificial Trajectories

Publication date :

03 November 2011

Event name :

Grascomp's Day

Event organizer :

GRASCOMP Graduate School in Computing Science

Event place :

Bruxelles, Belgium

Event date :

November 3th, 2011

Available on ORBi :

since 13 May 2015

Statistics

Number of views

39 (3 by ULiège)

Number of downloads

56 (2 by ULiège)

More statistics