johan-gras / MuZeroLinks
A structured implementation of MuZero
☆206Updated 3 years ago
Alternatives and similar repositories for MuZero
Users that are interested in MuZero are comparing it to the libraries listed below
Sorting:
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆166Updated 4 years ago
- Pytorch Implementation of MuZero☆353Updated 2 years ago
- ☆66Updated 4 years ago
- An environment of the board game Go using OpenAI's Gym API☆176Updated 3 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆119Updated last year
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆490Updated last year
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆578Updated 3 years ago
- An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.☆683Updated last year
- Vectorized interface for reinforcement learning environments☆142Updated 2 years ago
- A Python interface for reinforcement learning environments☆387Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆84Updated 6 years ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆275Updated 3 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆206Updated 3 years ago
- ☆320Updated 11 months ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆916Updated last year
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆300Updated last year
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆82Updated 11 months ago
- A simple chess environment for openai/gym☆163Updated last year
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆129Updated 2 years ago
- DeepMind Alchemy task environment: a meta-reinforcement learning benchmark☆202Updated 2 years ago
- A simple stochastic OpenAI environment for training RL agents☆88Updated 2 years ago
- OpenAI Gym wrapper for ViZDoom enviroments☆70Updated 4 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆46Updated 2 years ago
- Paired Open-Ended Trailblazer (POET) and Enhanced POET☆257Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Updated 2 years ago
- My implementation of the Proximal Policy Optisation algorithm using Keras as a backend☆88Updated 6 years ago
- CuLE: A CUDA port of the Atari Learning Environment (ALE)☆241Updated 3 years ago
- Code for the paper "Exploration by Random Network Distillation"☆924Updated 5 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆127Updated last year