scascin0 / alphazeroLinks
A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing too much.
☆13Updated 2 years ago
Alternatives and similar repositories for alphazero
Users that are interested in alphazero are comparing it to the libraries listed below
Sorting:
- Tabular methods for reinforcement learning☆38Updated 5 years ago
- Car racing RL agents in actual F1 tracks☆13Updated 9 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆58Updated 3 years ago
- Baselines for gymnax 🤖☆71Updated 2 years ago
- Reinforcement learning in pure JAX.☆13Updated 5 months ago
- ☆28Updated 3 years ago
- An implementation of AlphaZero and MCTS with neural networks for Tetris☆21Updated 4 months ago
- ☆102Updated last year
- ☆18Updated last year
- MultiTask Environments for Reinforcement Learning.☆76Updated 2 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated 11 months ago
- Neuro-evolution for OpenAI Gym environments☆57Updated 4 years ago
- Modular framework for Reinforcement Learning in python☆174Updated 2 years ago
- JAX implementations of core Deep RL algorithms☆81Updated 3 years ago
- Reinforcement learning library in JAX.☆100Updated last year
- Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905☆33Updated 3 years ago
- Vectorization techniques for fast population-based training.☆56Updated 3 years ago
- megastep helps you build 1-million FPS reinforcement learning environments on a single GPU☆140Updated 3 years ago
- General Modules for JAX☆67Updated 4 months ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Updated last year
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- PyTorch code to train and evaluate Procgen tasks☆25Updated 4 years ago
- A practical step-by-step guide to applying RUDDER☆35Updated 5 years ago
- impact-driven-exploration☆131Updated last year
- Starter Kit for NeurIPS 2020 - Procgen Competition on AIcrowd☆91Updated 2 years ago
- ☆84Updated 4 years ago
- Evolution Strategy Library☆55Updated 5 years ago
- RL agent to play μRTS with Stable-Baselines3 and PyTorch☆26Updated 3 years ago