johan-gras / MuZeroLinks
A structured implementation of MuZero
☆205Updated 3 years ago
Alternatives and similar repositories for MuZero
Users that are interested in MuZero are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of MuZero☆354Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆168Updated 4 years ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- ☆66Updated 4 years ago
- An environment of the board game Go using OpenAI's Gym API☆177Updated 3 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆119Updated last year
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆579Updated 3 years ago
- An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.☆689Updated last year
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆129Updated 2 years ago
- Paired Open-Ended Trailblazer (POET) and Enhanced POET☆259Updated 3 years ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆491Updated last year
- A Python interface for reinforcement learning environments☆388Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆84Updated 6 years ago
- Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.☆921Updated 2 years ago
- DeepMind Alchemy task environment: a meta-reinforcement learning benchmark☆203Updated 2 years ago
- ☆324Updated last year
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆278Updated 5 months ago
- Vectorized interface for reinforcement learning environments☆142Updated 2 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆151Updated 2 years ago
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆301Updated last year
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆85Updated 6 years ago
- Qiita投稿用に作成したAgent57(強化学習)の実装コードです。☆45Updated 2 years ago
- A grid-world game engine for game AI research☆253Updated last year
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆206Updated 5 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆84Updated last year
- A simple stochastic OpenAI environment for training RL agents☆88Updated 2 years ago
- A simple chess environment for openai/gym☆164Updated last year
- CuLE: A CUDA port of the Atari Learning Environment (ALE)☆242Updated 3 years ago
- Keeping track of RL experiments☆165Updated 3 years ago