Zeta36 / muzero
A simple implementation of MuZero algorithm for connect4 game
☆97Updated 4 years ago
Alternatives and similar repositories for muzero:
Users that are interested in muzero are comparing it to the libraries listed below
- A structured implementation of MuZero☆207Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆157Updated 3 years ago
- ☆67Updated 3 years ago
- Pytorch Implementation of MuZero☆349Updated last year
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆114Updated 7 months ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- ☆298Updated 3 months ago
- impact-driven-exploration☆129Updated last year
- ☆50Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆75Updated 6 years ago
- ☆100Updated last year
- An environment of the board game Go using OpenAI's Gym API☆174Updated 2 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 4 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆171Updated 7 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆196Updated 2 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆146Updated last year
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆30Updated 2 years ago
- JAX implementations of core Deep RL algorithms☆79Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆259Updated last year
- A collection of baselines for the MineRL environment/datasets & the NeurIPS 2021 MineRL competitions☆147Updated 3 years ago
- Keeping track of RL experiments☆162Updated 2 years ago
- A grid-world game engine for game AI research☆239Updated 11 months ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 5 years ago
- FQF(Fully parameterized Quantile Function for distributional reinforcement learning) is a general reinforcement learning framework for At…☆42Updated 4 years ago
- DeepMind Alchemy task environment: a meta-reinforcement learning benchmark☆200Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆116Updated 3 years ago