Paper published on a website (Scientific congresses and symposiums)
Informed POMDP: Leveraging Additional Information in Model-Based RL
Lambrechts, Gaspard; Bolland, Adrien; Ernst, Damien
2024Reinforcement Learning Conference
Peer reviewed
 

Files


Full Text
informed-pomdp-rlc.pdf
Author postprint (1.03 MB) Creative Commons License - Attribution
Download
Annexes
informed-pomdp.pdf
(1.34 MB) Creative Commons License - Attribution, ShareAlike
Slides
Download
informed-pomdp-poster.pdf
(349.82 kB) Creative Commons License - Attribution, ShareAlike
Poster
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Computer Science - Learning
Abstract :
[en] In this work, we generalize the problem of learning through interaction in a POMDP by accounting for eventual additional information available at training time. First, we introduce the informed POMDP, a new learning paradigm offering a clear distinction between the information at training and the observation at execution. Next, we propose an objective that leverages this information for learning a sufficient statistic of the history for the optimal control. We then adapt this informed objective to learn a world model able to sample latent trajectories. Finally, we empirically show a learning speed improvement in several environments using this informed world model in the Dreamer algorithm. These results and the simplicity of the proposed adaptation advocate for a systematic consideration of eventual additional information when learning in a POMDP using model-based RL.
Disciplines :
Computer science
Author, co-author :
Lambrechts, Gaspard ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Bolland, Adrien ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Ernst, Damien  ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Language :
English
Title :
Informed POMDP: Leveraging Additional Information in Model-Based RL
Publication date :
August 2024
Event name :
Reinforcement Learning Conference
Event place :
Amherst, United States - Massachusetts
Event date :
August 9th, 2024
Audience :
International
Peer reviewed :
Peer reviewed
Tags :
CÉCI : Consortium des Équipements de Calcul Intensif
Tier-1 supercomputer
Commentary :
10 pages, 22 pages total, 10 figures
Available on ORBi :
since 21 June 2023

Statistics


Number of views
146 (76 by ULiège)
Number of downloads
48 (16 by ULiège)

Bibliography


Similar publications



Contact ORBi