Paper published in a book (Scientific congresses and symposiums)
Policy search with cross-entropy optimization of basis functions
Busoniu, Lucian; Ernst, Damien; De Schutter, Bart et al.
2009In Proceedings of the IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL-09)
Peer reviewed
 

Files


Full Text
adprl-ceps.pdf
Publisher postprint (303.98 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
reinforcement learning; direct policy search; cross-entropy optimization
Abstract :
[en] This paper introduces a novel algorithm for approximate policy search in continuous-state, discrete-action Markov decision processes (MDPs). Previous policy search approaches have typically used ad-hoc parameterizations developed for specific MDPs. In contrast, the novel algorithm employs a flexible policy parameterization, suitable for solving general discrete-action MDPs. The algorithm looks for the best closed-loop policy that can be represented using a given number of basis functions, where a discrete action is assigned to each basis function. The locations and shapes of the basis functions are optimized, together with the action assignments. This allows a large class of policies to be represented. The optimization is carried out with the cross-entropy method and evaluates the policies by their empirical return from a representative set of initial states. We report simulation experiments in which the algorithm reliably obtains good policies with only a small number of basis functions, albeit at sizable computational costs.
Disciplines :
Computer science
Author, co-author :
Busoniu, Lucian
Ernst, Damien  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
De Schutter, Bart
Babuska, Robert
Language :
English
Title :
Policy search with cross-entropy optimization of basis functions
Publication date :
2009
Event name :
IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL-09)
Event place :
Nashville, United States
Event date :
March 30 - April 2, 2009
Audience :
International
Main work title :
Proceedings of the IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL-09)
ISBN/EAN :
978-1-4244-2761-1
Pages :
153-160
Peer reviewed :
Peer reviewed
Funders :
F.R.S.-FNRS - Fonds de la Recherche Scientifique [BE]
Available on ORBi :
since 03 June 2009

Statistics


Number of views
63 (10 by ULiège)
Number of downloads
211 (1 by ULiège)

Scopus citations®
 
10
Scopus citations®
without self-citations
7
OpenCitations
 
6

Bibliography


Similar publications



Contact ORBi