Paper published in a book (Scientific congresses and symposiums)
Uncertainty-Aware Reinforcement Learning Agents for Noisy Environments
Singh, Akash; Ittoo, Ashwin; Vandomme, Elise et al.
2025In 2025 tenth International Conference on Information Technology Trends (ITT)
Peer reviewed Dataset
 

Files


Full Text
camerareadySinghAkashUARL.pdf
Author postprint (666.6 kB) Creative Commons License - Public Domain Dedication
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Uncertainty; Epistemic; Masksembles; Deep Ensemble; Reinforcement Learning; Deep reinforcement learning; Churn
Abstract :
[en] Reinforcement Learning (RL) agents are highly sensitive to noise, particularly consecutive noisy states that destabilize training and can trigger catastrophic forgetting, a phenomenon inherent in real-world data as well. While uncertainty estimation has been widely explored for guiding exploration, its role in stabilizing value updates under noisy conditions remains relatively underexplored. In this work, we introduce MASURE (Masksembles for Stable and Uncertainty- aware Reinforcement Learning Environments), a novel framework that integrates Masksembles-based epistemic uncertainty into Q learning. MASURE employs uncertainty-conscious value updates, leveraging the epistemic uncertainty to stabilize learning in noisy environments. We evaluate MASURE in both popular online RL benchmarks with sustained noise spanning consecutive states and in an offline real-world churn prediction task with inherently noisy features to test training stability. Across both settings, MASURE consistently improves stability and predictive performance, outperforming standard RL agents (DQN, BootstrapDQN) and state-of-the-art UE baselines (SunriseDQN, IVDQN). In noisy online benchmarks, MASURE achieves higher and more stable returns than IVDQN, while in the offline churn prediction task it attains the highest balanced accuracy (64.3%), surpassing DQN (63.5%), BootstrapDQN (63.8%), SunriseDQN (61.9%), and IVDQN (62.0%). Importantly, MASURE achieves these gains with significantly lower computational cost than deep ensembles, making it suitable for large-scale real world applications
Disciplines :
Engineering, computing & technology: Multidisciplinary, general & others
Author, co-author :
Singh, Akash  ;  Université de Liège - ULiège > HEC Liège : UER > UER Opérations : Systèmes d'information de gestion
Ittoo, Ashwin ;  Université de Liège - ULiège > HEC Liège : UER > UER Opérations : Systèmes d'information de gestion
Vandomme, Elise  ;  Université de Liège - ULiège > HEC Liège Research > HEC Liège Research: Business Analytics & Supply Chain Mgmt
Ars, Pierre ;  Ethias Insurance
Language :
English
Title :
Uncertainty-Aware Reinforcement Learning Agents for Noisy Environments
Publication date :
06 November 2025
Event name :
10th International Conference on Information Technology Trends
Event organizer :
Higher college of Technology
Event place :
Dubai, United Arab Emirates
Event date :
from 6 to 7 November 2025
Audience :
International
Main work title :
2025 tenth International Conference on Information Technology Trends (ITT)
Publisher :
IEEE
Peer review/Selection committee :
Peer reviewed
Data Set :
Available on ORBi :
since 14 November 2025

Statistics


Number of views
80 (15 by ULiège)
Number of downloads
111 (2 by ULiège)

OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBi