Paper published on a website (Scientific congresses and symposiums)
Informed POMDP: Leveraging Additional Information in Model-Based RL
Lambrechts, Gaspard; Bolland, Adrien; Ernst, Damien
2024Reinforcement Learning Conference
Peer reviewed
 

Files


Full Text
informed-pomdp-rlc.pdf
Author postprint (1.03 MB) Creative Commons License - Attribution
Download
Annexes
informed-pomdp-poster.pdf
(349.82 kB) Creative Commons License - Attribution, ShareAlike
Poster
Download
informed-pomdp.pdf
(363.52 kB) Creative Commons License - Attribution
Slides
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Computer Science - Learning
Abstract :
[en] In this work, we generalize the problem of learning through interaction in a POMDP by accounting for eventual additional information available at training time. First, we introduce the informed POMDP, a new learning paradigm offering a clear distinction between the information at training and the observation at execution. Next, we propose an objective that leverages this information for learning a sufficient statistic of the history for the optimal control. We then adapt this informed objective to learn a world model able to sample latent trajectories. Finally, we empirically show a learning speed improvement in several environments using this informed world model in the Dreamer algorithm. These results and the simplicity of the proposed adaptation advocate for a systematic consideration of eventual additional information when learning in a POMDP using model-based RL.
Disciplines :
Computer science
Author, co-author :
Lambrechts, Gaspard ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Bolland, Adrien ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Ernst, Damien  ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Language :
English
Title :
Informed POMDP: Leveraging Additional Information in Model-Based RL
Publication date :
August 2024
Event name :
Reinforcement Learning Conference
Event place :
Amherst, United States - Massachusetts
Event date :
August 9th, 2024
Audience :
International
Peer reviewed :
Peer reviewed
Tags :
CÉCI : Consortium des Équipements de Calcul Intensif
Tier-1 supercomputer
Commentary :
10 pages, 22 pages total, 10 figures
Available on ORBi :
since 21 June 2023

Statistics


Number of views
178 (93 by ULiège)
Number of downloads
57 (21 by ULiège)

Bibliography


Similar publications



Sorry the service is unavailable at the moment. Please try again later.
Contact ORBi