Statistics of Batch Mode Reinforcement Learning based on the Synthesis of Artificial Trajectories

Contact ORBi