![]() ![]() | Castronovo, M. (2017). Offline Policy-search in Bayesian Reinforcement Learning [Doctoral thesis, ULiège - Université de Liège]. ORBi-University of Liège. https://orbi.uliege.be/handle/2268/208421 |
![]() ![]() | Castronovo, M., François-Lavet, V., Fonteneau, R., Ernst, D., & Couëtoux, A. (2017). Approximate Bayes Optimal Policy Search using Neural Networks. In Proceedings of the 9th International Conference on Agents and Artificial Intelligence (ICAART 2017). doi:10.5220/0006191701420153 ![]() |
![]() ![]() | Castronovo, M., Ernst, D., Couëtoux, A., & Fonteneau, R. (2016). Benchmarking for Bayesian Reinforcement Learning. PLoS ONE. doi:10.1371/journal.pone.0157088 ![]() |
![]() ![]() | Castronovo, M., Ernst, D., & Fonteneau, R. (2014). Bayes Adaptive Reinforcement Learning versus Off-line Prior-based Policy Search: an Empirical Comparison. In Proceedings of the 23rd annual machine learning conference of Belgium and the Netherlands (BENELEARN 2014). ![]() |
![]() ![]() | Castronovo, M., Ernst, D., & Fonteneau, R. (2014). Apprentissage par renforcement bayésien versus recherche directe de politique hors-ligne en utilisant une distribution a priori: comparaison empirique. In Proceedings des 9èmes Journée Francophones de Planification, Décision et Apprentissage. ![]() |
![]() ![]() | Castronovo, M. (2012). Learning for exploration/exploitation in reinforcement learning [Master’s dissertation, ULiège - Université de Liège]. ORBi-University of Liège. https://orbi.uliege.be/handle/2268/131885 |
![]() ![]() | Castronovo, M., Maes, F., Fonteneau, R., & Ernst, D. (2012). Learning exploration/exploitation strategies for single trajectory reinforcement learning. In Proceedings of the 10th European Workshop on Reinforcement Learning (EWRL 2012) (pp. 1-9). ![]() |