chiamp / muzero-cartpole
Applying DeepMind's MuZero algorithm to the cart pole environment in gym
☆21Updated last year
Alternatives and similar repositories for muzero-cartpole:
Users that are interested in muzero-cartpole are comparing it to the libraries listed below
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆54Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- A2C is a special case of PPO!☆20Updated 2 years ago
- Baselines for gymnax 🤖☆66Updated 2 years ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆83Updated 2 years ago
- A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer (https://arxi…☆71Updated 4 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆46Updated 4 years ago
- ☆51Updated 2 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21Updated last year
- ☆101Updated last year
- Deep Reinforcement Learning Framework done with PyTorch☆35Updated last month
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆111Updated 8 months ago
- Revisiting Rainbow☆74Updated 3 years ago
- A collection of RL algorithms written in JAX.☆97Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learning☆72Updated 8 months ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆96Updated last year
- Reinforcement Learning with Latent Flow☆43Updated 4 years ago
- AlphaZero for continuous control tasks☆23Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- Implicit Normalizing Flows + Reinforcement Learning☆61Updated 5 years ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆52Updated last year
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆82Updated last year
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆21Updated 2 years ago
- Neuro-evolution for OpenAI Gym environments☆56Updated 4 years ago
- unofficial code reproducing Agent57☆36Updated last year
- ☆16Updated 3 years ago