johan-gras / MuZeroLinks
A structured implementation of MuZero
☆204Updated 3 years ago
Alternatives and similar repositories for MuZero
Users that are interested in MuZero are comparing it to the libraries listed below
Sorting:
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆159Updated 4 years ago
- Pytorch Implementation of MuZero☆353Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 4 years ago
- ☆67Updated 3 years ago
- An environment of the board game Go using OpenAI's Gym API☆174Updated 3 years ago
- A Python interface for reinforcement learning environments☆371Updated 2 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆117Updated 11 months ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆474Updated last year
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆573Updated 2 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆201Updated 2 years ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆258Updated last year
- A grid-world game engine for game AI research☆244Updated last year
- ☆304Updated 6 months ago
- Vectorized interface for reinforcement learning environments☆140Updated 2 years ago
- Paired Open-Ended Trailblazer (POET) and Enhanced POET☆250Updated 3 years ago
- Real-World RL Benchmark Suite☆356Updated 4 years ago
- Modular framework for Reinforcement Learning in python☆173Updated 2 years ago
- A simple stochastic OpenAI environment for training RL agents☆88Updated 2 years ago
- DeepMind Alchemy task environment: a meta-reinforcement learning benchmark☆199Updated 2 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆217Updated 2 years ago
- OpenAI Gym wrapper for ViZDoom enviroments☆69Updated 3 years ago
- A simple chess environment for openai/gym☆161Updated last year
- An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.☆680Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆82Updated 6 years ago
- CuLE: A CUDA port of the Atari Learning Environment (ALE)☆238Updated 2 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆902Updated last year
- Code for the paper "Phasic Policy Gradient"☆262Updated 2 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆46Updated 2 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.