Statistics of Batch mode reinforcement learning based on the synthesis of artificial trajectories

Contact ORBi