Paper published on a website (Scientific congresses and symposiums)
Informed POMDP: Leveraging Additional Information in Model-Based RL
Lambrechts, Gaspard; Bolland, Adrien; Ernst, Damien
2023ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems
Peer reviewed
 

Files


Full Text
informed-pomdp.pdf
Author postprint (925.59 kB) Creative Commons License - Attribution
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Computer Science - Learning
Abstract :
[en] In this work, we generalize the problem of learning through interaction in a POMDP by accounting for eventual additional information available at training time. First, we introduce the informed POMDP, a new learning paradigm offering a clear distinction between the training information and the execution observation. Next, we propose an objective for learning a sufficient statistic from the history for the optimal control that leverages this information. We then show that this informed objective consists of learning an environment model from which we can sample latent trajectories. Finally, we show for the Dreamer algorithm that the convergence speed of the policies is sometimes greatly improved on several environments by using this informed environment model. Those results and the simplicity of the proposed adaptation advocate for a systematic consideration of eventual additional information when learning in a POMDP using model-based RL.
Disciplines :
Computer science
Author, co-author :
Lambrechts, Gaspard ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Bolland, Adrien ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Ernst, Damien  ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Language :
English
Title :
Informed POMDP: Leveraging Additional Information in Model-Based RL
Publication date :
June 2023
Event name :
ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems
Event place :
Honolulu, United States - Hawaii
Event date :
July 28th, 2023
Audience :
International
Peer reviewed :
Peer reviewed
Tags :
CÉCI : Consortium des Équipements de Calcul Intensif
Tier-1 supercomputer
Commentary :
8 pages, 13 pages total, 8 figures
Available on ORBi :
since 21 June 2023

Statistics


Number of views
78 (46 by ULiège)
Number of downloads
25 (7 by ULiège)

Bibliography


Similar publications



Contact ORBi