chiamp / muzero-cartpole
Applying DeepMind's MuZero algorithm to the cart pole environment in gym
☆21Updated last year
Alternatives and similar repositories for muzero-cartpole:
Users that are interested in muzero-cartpole are comparing it to the libraries listed below
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- ☆28Updated 2 years ago
- A collection of RL algorithms written in JAX.☆96Updated 2 years ago
- Baselines for gymnax 🤖☆66Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆50Updated last year
- AlphaZero for continuous control tasks☆23Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆34Updated 2 weeks ago
- Revisiting Rainbow☆74Updated 3 years ago
- ☆18Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆40Updated 2 years ago
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- A collection of meta-learning algorithms in Jax☆22Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- General Modules for JAX☆64Updated last month
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- A2C is a special case of PPO!☆19Updated 2 years ago
- ☆50Updated last year
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- ☆74Updated last week
- Vectorization techniques for fast population-based training.☆55Updated 2 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆34Updated 5 years ago
- Episodic Control☆19Updated 2 years ago
- PyTorch implementation of the Munchausen Reinforcement Learning Algorithms M-DQN and M-IQN☆45Updated 4 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Updated 3 years ago
- ☆74Updated 4 months ago