Policy transfer using Value Function as Prior Information

Aittahar, Samy; Sootla, Aivar

Download

Unpublished conference/Abstract (Scientific congresses and symposiums)

Policy transfer using Value Function as Prior Information

Aittahar, Samy; Sootla, Aivar; Ernst, Damien

2016 • Phd Forum

Permalink
https://hdl.handle.net/2268/221884

Files (2)Send to Details Statistics Bibliography Similar publications

Files

Full Text

report_springer.pdf

Publisher postprint (423.72 kB)

Download

Annexes

poster.pdf

Publisher postprint (234.77 kB)

Download

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Transfer Learning; Reinforcement Learning

Abstract :

[en] This work proposes an approach based on reward shaping techniques in a reinforcement learning setting to approximate the opti- mal decision-making process (also called the optimal policy) in a desired task with a limited amount of data. We extract prior information from an existing family of policies have been used as a heuristic to help the construction of the new one under this challenging condition. We use this approach to study the relationship between the similarity of two tasks and the minimal amount of data needed to compute a near-optimal pol- icy for the second one using the prior information of the existing policy. Preliminary results show that for the least similar existing task consid- ered compared to the desired one, only 10% of the dataset was needed to compute the corresponding near-optimal policy.

Disciplines :

Computer science

Author, co-author :

Aittahar, Samy ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Smart grids

Sootla, Aivar

Other collaborator :

Ernst, Damien ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Smart grids

Language :

English

Title :

Policy transfer using Value Function as Prior Information

Publication date :

19 September 2016

Number of pages :

Event name :

Phd Forum

Event organizer :

European Conference on Machine Learning

Event place :

Riva del Garda, Italy

Event date :

19th September 2016 - 23th September 2016

Audience :

International

Available on ORBi :

since 09 April 2018

Statistics

Number of views

72 (4 by ULiège)

Number of downloads

90 (11 by ULiège)

More statistics