scascin0 / alphazeroLinks
A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing too much.
β13Updated 2 years ago
Alternatives and similar repositories for alphazero
Users that are interested in alphazero are comparing it to the libraries listed below
Sorting:
- JAX implementations of core Deep RL algorithmsβ79Updated 3 years ago
- Baselines for gymnax π€β66Updated 2 years ago
- β18Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environmβ¦β41Updated 2 years ago
- MultiTask Environments for Reinforcement Learning.β76Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β56Updated 2 years ago
- A collection of RL algorithms written in JAX.β98Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselinβ¦β53Updated 3 weeks ago
- Accelerated replay buffers in JAXβ41Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ113Updated 9 months ago
- Tabular methods for reinforcement learningβ38Updated 4 years ago
- General Modules for JAXβ66Updated 2 months ago
- An implementation of MuZero in JAX.β56Updated 2 years ago
- β51Updated 2 years ago
- Modular framework for Reinforcement Learning in pythonβ173Updated 2 years ago
- Contains JAX implementation of algorithms for inverse reinforcement learningβ73Updated 9 months ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the β¦β86Updated 3 years ago
- Accelerated minigrid environments with JAXβ138Updated 3 weeks ago
- Reinforcement learning in pure JAX.β13Updated 3 months ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. aβ¦β21Updated 4 years ago
- β101Updated last year
- impact-driven-explorationβ131Updated last year
- β28Updated 2 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.β38Updated 4 years ago
- Various reinforcement learning algorithms written in Jax + Flaxβ24Updated last year
- OpenAI Gym wrapper for ViZDoom enviromentsβ69Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorchβ36Updated 2 months ago
- A2C is a special case of PPO!β21Updated 3 years ago
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.β95Updated last year
- A C++ pytorch implementation of MuZeroβ38Updated last year