Paper published in a journal (Scientific congresses and symposiums)
Informed Asymmetric Actor-Critic: Leveraging Privileged Signals Beyond Full-State Access
Ebi, Daniel; Ernst, Damien; Böhm, Klemens et al.
2026In Proceedings of Machine Learning Research
Peer Reviewed verified by ORBi
 

Files


Full Text
informed-critic-icml.pdf
Author postprint (3.84 MB)
Download

All documents in ORBi are protected by a user license.

Send to



Details



Keywords :
Asymmetric Reinforcement Learning; Asymmetric Actor-Critic; Partial Observability; POMDP; Privileged Information; Informativeness
Abstract :
[en] Asymmetric reinforcement learning leverages privileged information available during training to improve learning under partial observability. Existing asymmetric actor-critic methods typically assume access to the full environment state to condition the critic during training, which is often unrealistic in practice. We introduce the informed asymmetric actor-critic framework that allows the critic to be conditioned on arbitrary state-dependent privileged signals, and show that any such signal yields unbiased policy gradient estimates. This substantially expands the set of admissible privileged information and raises the problem of selecting the most informative signals for learning. To this end, we propose two novel informativeness criteria: a dependence-based test that can be applied prior to training, and a test based on improvements in value prediction that can be applied post hoc. Experiments on partially observable benchmarks and synthetic environments demonstrate that carefully selected privileged signals can match or outperform full-state asymmetric baselines while relying on strictly less state information.
Disciplines :
Computer science
Author, co-author :
Ebi, Daniel;  KIT - Karlsruhe Institute of Technology > Department of Computer Science
Ernst, Damien  ;  Université de Liège - ULiège > Montefiore Institute of Electrical Engineering and Computer Science
Böhm, Klemens;  KIT - Karlsruhe Institute of Technology > Department of Computer Science
Lambrechts, Gaspard ;  McGill University > Department of Electrical and Computer Engineering ; Mila - Québec Artificial Intelligence Institute
Language :
English
Title :
Informed Asymmetric Actor-Critic: Leveraging Privileged Signals Beyond Full-State Access
Publication date :
July 2026
Event name :
International Conference on Machine Learning
Event place :
Seoul, South Korea
Event date :
July 6th, 2026
Audience :
International
Journal title :
Proceedings of Machine Learning Research
eISSN :
2640-3498
Publisher :
PMLR
Peer review/Selection committee :
Peer Reviewed verified by ORBi
Funders :
F.R.S.-FNRS - Fund for Scientific Research
Available on ORBi :
since 09 June 2026

Statistics


Number of views
8 (3 by ULiège)
Number of downloads
6 (0 by ULiège)

Bibliography


Similar publications



Contact ORBi