chiamp / muzero-cartpole

Applying DeepMind's MuZero algorithm to the cart pole environment in gym
20Updated last year

Related projects: