Scientific conference in universities or research centers (Scientific conferences in universities or research centers)
Presentation: Behind the Myth of Exploration in Policy Gradients
Bolland, Adrien
2024
 

Files


Full Text
Behind the Myth of Exploration in Policy Gradients.pdf
Author preprint (992.15 kB) Creative Commons License - Attribution, Non-Commercial, ShareAlike
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
reinforcement learning; exploration; policy gradient
Abstract :
[en] Policy-gradient algorithms are effective reinforcement learning methods for solving control problems with continuous state and action spaces. To compute near-optimal policies, it is essential in practice to include exploration terms in the learning objective. Although the effectiveness of these terms is usually justified by an intrinsic need to explore environments, we propose a novel analysis and distinguish two different implications of these techniques. First, they make it possible to smooth the learning objective and to eliminate local optima while preserving the global maximum. Second, they modify the gradient estimates, increasing the probability that the stochastic parameter update eventually provides an optimal policy. In light of these effects, we discuss and illustrate empirically exploration strategies based on entropy bonuses, highlighting their limitations and opening avenues for future works in the design and analysis of such strategies.
Disciplines :
Computer science
Author, co-author :
Bolland, Adrien ;  Université de Liège - ULiège > Département d'électricité, électronique et informatique (Institut Montefiore) > Smart grids
Language :
English
Title :
Presentation: Behind the Myth of Exploration in Policy Gradients
Publication date :
19 February 2024
Event name :
Machine Learning and AI academy
Event organizer :
Haitham Bou Ammar
Event date :
January 19th, 2024
Audience :
International
Funders :
F.R.S.-FNRS - Fund for Scientific Research [BE]
Commentary :
Link to the talk : https://www.youtube.com/watch?v=nqp1LUaFecQ
Available on ORBi :
since 15 February 2024

Statistics


Number of views
46 (11 by ULiège)
Number of downloads
15 (4 by ULiège)

Bibliography


Similar publications



Contact ORBi