Paper published in a book (Scientific congresses and symposiums)
Lipschitz robust control from off-policy trajectories
Fonteneau, Raphaël; Ernst, Damien; Boigelot, Bernard et al.
2014In Proceedings of the 53rd IEEE Conference on Decision and Control (IEEE CDC 2014)
Peer reviewed
 

Files


Full Text
CDC_SIAM.pdf
Author preprint (358.35 kB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Abstract :
[en] We study the minmax optimization problem introduced in [Fonteneau et al. (2011), ``Towards min max reinforcement learning'', Springer CCIS, vol. 129, pp. 61-77] for computing control policies for batch mode reinforcement learning in a deterministic setting with fixed, finite optimization horizon. First, we state that the $\min$ part of this problem is NP-hard. We then provide two relaxation schemes. The first relaxation scheme works by dropping some constraints in order to obtain a problem that is solvable in polynomial time. The second relaxation scheme, based on a Lagrangian relaxation where all constraints are dualized, can also be solved in polynomial time. We theoretically show that both relaxation schemes provide better results than those given in [Fonteneau et al. (2011)]
Disciplines :
Computer science
Author, co-author :
Fonteneau, Raphaël ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Ernst, Damien  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Smart grids
Boigelot, Bernard  ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Informatique
Louveaux, Quentin ;  Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Système et modélisation : Optimisation discrète
Language :
English
Title :
Lipschitz robust control from off-policy trajectories
Publication date :
2014
Event name :
53rd IEEE Conference on Decision and Control (IEEE CDC 2014)
Event place :
Los Angeles, United States
Event date :
December 15-17, 2014
Audience :
International
Main work title :
Proceedings of the 53rd IEEE Conference on Decision and Control (IEEE CDC 2014)
Peer reviewed :
Peer reviewed
Available on ORBi :
since 14 October 2014

Statistics


Number of views
124 (15 by ULiège)
Number of downloads
254 (12 by ULiège)

Scopus citations®
 
0
Scopus citations®
without self-citations
0

Bibliography


Similar publications



Contact ORBi