An Optimistic Posterior Sampling Strategy for Bayesian Reinforcement Learning

Fonteneau, Raphaël; Korda, Nathan; Munos, Rémi

Download

Paper published in a book (Scientific congresses and symposiums)

An Optimistic Posterior Sampling Strategy for Bayesian Reinforcement Learning

Fonteneau, Raphaël; Korda, Nathan; Munos, Rémi

2013 • In NIPS 2013 Workshop on Bayesian Optimization (BayesOpt2013)

Peer reviewed

Permalink
https://hdl.handle.net/2268/161786

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

OPS_BRL.pdf

Author postprint (359.44 kB)

Download

All documents in ORBi are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Reinforcement Learning; Bayesian Optimization; Markov Decision Processes

Abstract :

[en] We consider the problem of decision making in the context of unknown Markov decision processes with finite state and action spaces. In a Bayesian reinforcement learning framework, we propose an optimistic posterior sampling strategy based on the maximization of state-action value functions of MDPs sampled from the posterior. First experiments are promising.

Disciplines :

Computer science

Author, co-author :

Fonteneau, Raphaël ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation

Korda, Nathan; University of Oxford, England

Munos, Rémi; Inria Lille - Nord Europe

Language :

English

Title :

An Optimistic Posterior Sampling Strategy for Bayesian Reinforcement Learning

Publication date :

2013

Event name :

NIPS 2013 Workshop on Bayesian Optimization (BayesOpt2013)

Event date :

10 décembre 2013

Main work title :

NIPS 2013 Workshop on Bayesian Optimization (BayesOpt2013)

Peer reviewed :

Peer reviewed

Available on ORBi :

since 21 January 2014

Statistics

Number of views

234 (2 by ULiège)

Number of downloads

633 (3 by ULiège)

More statistics

Bibliography

Similar publications

Sorry the service is unavailable at the moment. Please try again later.

Name

Provider / Domaine

Expiration

Description

JSESSIONID

Oracle Corporation

www.uliege.be

Session

General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.

CookieScriptConsent

CookieScript

.uliege.be

1 year

This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.

Name

Provider / Domaine

Expiration

Description

_pk_id

InnoCraft Ltd

.uliege.be

1 year

Used to store a few details about the user such as the unique visitor ID

_pk_ses

InnoCraft Ltd

.uliege.be

30 minutes

Short lived cookies used to temporarily store data for the visit

_pk_ref

InnoCraft Ltd

.uliege.be

6 months

Used to store the attribution information, the referrer initially used to visit the website

Name	Provider / Domaine	Expiration	Description
JSESSIONID	Oracle Corporation www.uliege.be	Session	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.
CookieScriptConsent	CookieScript .uliege.be	1 year	This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.

Name	Provider / Domaine	Expiration	Description
_pk_id	InnoCraft Ltd .uliege.be	1 year	Used to store a few details about the user such as the unique visitor ID
_pk_ses	InnoCraft Ltd .uliege.be	30 minutes	Short lived cookies used to temporarily store data for the visit
_pk_ref	InnoCraft Ltd .uliege.be	6 months	Used to store the attribution information, the referrer initially used to visit the website