Zeta36 / muzero
A simple implementation of MuZero algorithm for connect4 game
☆97Updated 4 years ago
Alternatives and similar repositories for muzero:
Users that are interested in muzero are comparing it to the libraries listed below
- A structured implementation of MuZero☆207Updated 2 years ago
- ☆67Updated 3 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆158Updated 4 years ago
- Pytorch Implementation of MuZero☆351Updated last year
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆115Updated 8 months ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- A PyTorch implementation of DeepMind's MuZero agent☆33Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆198Updated 2 years ago
- ☆51Updated 2 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- OpenAI Gym wrapper for ViZDoom enviroments☆69Updated 3 years ago
- AlphaZero in JAX☆77Updated last year
- Starter Kit for NeurIPS 2020 - Procgen Competition on AIcrowd☆91Updated 2 years ago
- DeepMind Alchemy task environment: a meta-reinforcement learning benchmark☆198Updated 2 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆77Updated 6 years ago
- An environment of the board game Go using OpenAI's Gym API☆173Updated 2 years ago
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- impact-driven-exploration☆130Updated last year
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆31Updated 2 years ago
- The submission template for the MineRL Competition @ NeurIPS 2021. Clone this to make a new submission!☆93Updated 3 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆58Updated 5 months ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆73Updated 4 months ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆147Updated 2 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- ☆299Updated 3 months ago