HIV infection dynamics; drug-scheduling strategies; reinforcement learning; fitted Q iteration
Abstract :
[en] This paper addresses the problem of computing optimal structured treatment interruption strategies for HIV infected patients. We show that reinforcement learning may be useful to extract such strategies directly from clinical data, without the need of an accurate mathematical model of HIV infection dynamics. To support our claims, we report simulation results obtained by running a recently proposed batch-mode reinforcement learning algorithm, known as fitted Q iteration, on numerically generated data.
Disciplines :
Computer science Immunology & infectious disease
Author, co-author :
Ernst, Damien ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Stan, Guy-Bart; University of Cambridge > Department of Engineering > Control Group
Gonçalves, Jorge; University of Cambridge > Department of Engineering > Control Group
Wehenkel, Louis ; Université de Liège - ULiège > Dép. d'électric., électron. et informat. (Inst.Montefiore) > Systèmes et modélisation
Language :
English
Title :
Clinical data based optimal STI strategies for HIV: a reinforcement learning approach
Publication date :
2006
Event name :
45th IEEE Conference on Decision and Control (CDC 2006)
Event place :
San Diego, United States
Event date :
13-15 December 2006
Audience :
International
Main work title :
Proceedings of the 45th IEEE Conference on Decision and Control (CDC 2006)
scite shows how a scientific paper has been cited by providing the context of the citation, a classification describing whether it supports, mentions, or contrasts the cited claim, and a label indicating in which section the citation was made.
Bibliography
Adams, B., Banks, H., Kwon, H.-D., & Tran, H. (2004). Dynamic multidrug therapies for HIV: Optimal and STI control approaches. Mathematical Biosciences and Engineering, 1, 223-241.
Bajaria, S., Webb, G., & Kirschner, D. (2004). Predicting differential responses to structured treatment interruptions during HAART. Bull. Math. Biol., 66, 1093-1118.
Bertsekas, D. (2000). Dynamic Programming and Optimal Control, vol. I. Belmont, MA: Athena Scientific. 2nd edition.
Bitmead, R., Gevers, M., & Werts, V. (1990). Adaptive Optimal Control: The Thinking Man's GPC. Prentice Hall International.
Bonhoeffer, S., Rembiszewski, M., Ortiz, G., & Nixon, D. (2000). Risks and benefits of structured antiretroviral drug therapy interruptions in HIV-I infection. AIDS, 14, 2313-2322.
Ernst, D., Geurts, P., & Wehenkel, L. (2005). Tree-based batch mode reinforcement learning. Journal of Machine Learning Research, 6, 503-556.
Geurts, P., Ernst, D., & Wehenkel, L. (2006). Extremely randomized trees. Machine Learning, 36, 3-42.
Lisziewicz, J., Rosenberg, E., & Liebermann, J. (1999). Control of HIV despite the discontinuation of anti-retroviral therapy. New England J. Med., 340, 1683-1684.
Lisziewicz, J., Rosenberr, E., & Liebermann, J. (2000). Structured treatment interruptions to control HIV-I infection. The Lancet, 354, 287-288.
Murphy, K. (2000). A survey of POMDP solution techniques (Technical Report). University of California at Berkeley.
Sutton, R., & Barto, A. (1998). Reinforcement Learning, an Introduction. MIT Press.
Similar publications
Sorry the service is unavailable at the moment. Please try again later.
This website uses cookies to improve user experience. Read more
Save & Close
Accept all
Decline all
Show detailsHide details
Cookie declaration
About cookies
Strictly necessary
Performance
Strictly necessary cookies allow core website functionality such as user login and account management. The website cannot be used properly without strictly necessary cookies.
This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.
Performance cookies are used to see how visitors use the website, eg. analytics cookies. Those cookies cannot be used to directly identify a certain visitor.
Used to store the attribution information, the referrer initially used to visit the website
Cookies are small text files that are placed on your computer by websites that you visit. Websites use cookies to help users navigate efficiently and perform certain functions. Cookies that are required for the website to operate properly are allowed to be set without your permission. All other cookies need to be approved before they can be set in the browser.
You can change your consent to cookie usage at any time on our Privacy Policy page.