[en] We report in this paper some positive simulation results obtained when image pixels are directly used as input state of a reinforcement learning algorithm. The reinforcement learning algorithm chosen to carry out the simulation is a batch-mode algorithm known as fitted Q iteration.
Disciplines :
Computer science
Author, co-author :
Ernst, Damien ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Marée, Raphaël ; Université de Liège - ULiège > Systèmes et modélisation
Wehenkel, Louis ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Language :
English
Title :
Reinforcement learning with raw image pixels as input state
Publication date :
2006
Event name :
International Workshop on Intelligent Computing in Pattern Analysis/Synthesis (IWICPAS)
Event place :
Xi'an, China
Event date :
August 26-27, 2006
Audience :
International
Main work title :
Advances in machine vision, image processing & pattern analysis (Lecture notes in computer science, Vol. 4153)
D. Ernst, P. Geurts, and L. Wehenkel, Tree-based batch mode reinforcement learning. Journal of Machine Learning Research, 6:503-556, April 2005.
P. Geurts, D. Ernst, and L. Wehenkel. Extremely randomized trees. Machine Learning, 36(1):3-42, 2006.
S. Jodogne and S. Piater. Interactive learning of mappings from visual percepts to actions. In L. De Raedt and S. Wrobel, editors, Proceedings of the 22nd International Conference on Machine Learning, pages 393-400, August 2005.
M. Lagoudakis and R. Parr. Reinforcement learning as classification: leveraging modern classifiers. In T. Faucett and N. Mishra, editors, Proceedings of 20th International Conference on Machine Learning, pages 424-431, 2003.
R. Marée, P. Geurts, J. Piater, and L. Wehenkel. Random subwindows for robust image classification. In C. Schmid, S. Soatto, and C. Tomasi, editors, Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, volume 1, pages 34-40. IEEE, June 2005.
Similar publications
Sorry the service is unavailable at the moment. Please try again later.
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.