Informed POMDP: Leveraging Additional Information in Model-Based RL

[en] In this work, we generalize the problem of learning through interaction in a POMDP by accounting for eventual additional information available at training time. First, we introduce the informed POMDP, a new learning paradigm offering a clear distinction between the information at training and the observation at execution. Next, we propose an objective that leverages this information for learning a sufficient statistic of the history for the optimal control. We then adapt this informed objective to learn a world model able to sample latent trajectories. Finally, we empirically show a learning speed improvement in several environments using this informed world model in the Dreamer algorithm. These results and the simplicity of the proposed adaptation advocate for a systematic consideration of eventual additional information when learning in a POMDP using model-based RL.

Disciplines :

Computer science

Author, co-author :

Lambrechts, Gaspard ; Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids

Bolland, Adrien ; Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids

Ernst, Damien ; Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids

Language :

English

Title :

Informed POMDP: Leveraging Additional Information in Model-Based RL

Publication date :

August 2024

Event name :

Reinforcement Learning Conference

Event place :

Amherst, United States - Massachusetts

Event date :

August 9th, 2024

Audience :

International

Peer reviewed :

Peer reviewed

Source :

https://openreview.net/forum?id=3e1TkSoVXb

Tags :

CÉCI : Consortium des Équipements de Calcul Intensif
Tier-1 supercomputer

Additional URL :

https://arxiv.org/abs/2306.11488

Commentary :

10 pages, 22 pages total, 10 figures

Available on ORBi :

since 21 June 2023

Statistics

Number of views

178 (93 by ULiège)

Number of downloads

57 (21 by ULiège)

More statistics

Bibliography

Similar publications

Sorry the service is unavailable at the moment. Please try again later.

Name	Provider / Domaine	Expiration	Description
JSESSIONID	Oracle Corporation www.uliege.be	Session	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.
CookieScriptConsent	CookieScript .uliege.be	1 year	This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.

Name	Provider / Domaine	Expiration	Description
_pk_id	InnoCraft Ltd .uliege.be	1 year	Used to store a few details about the user such as the unique visitor ID
_pk_ses	InnoCraft Ltd .uliege.be	30 minutes	Short lived cookies used to temporarily store data for the visit
_pk_ref	InnoCraft Ltd .uliege.be	6 months	Used to store the attribution information, the referrer initially used to visit the website