tmoer / alphazero_singleplayer
Single player Alpha Zero implementation
☆42Updated 2 years ago
Alternatives and similar repositories for alphazero_singleplayer:
Users that are interested in alphazero_singleplayer are comparing it to the libraries listed below
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆113Updated 6 months ago
- Scalable Implementation of Neural Fictitous Self-Play☆74Updated 5 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- ☆66Updated 3 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆69Updated last month
- A structured implementation of MuZero☆207Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆155Updated 3 years ago
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆35Updated 3 years ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆60Updated last year
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- A framework for easy prototyping of distributed reinforcement learning algorithms☆95Updated 4 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆203Updated last year
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆120Updated 9 months ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆115Updated 3 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆43Updated 2 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- Modular framework for Reinforcement Learning in python☆170Updated last year
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆29Updated 5 months ago
- ☆49Updated last year
- Reproducing results from DeepMind's paper on Population Based Training of Neural Networks.☆56Updated 6 years ago
- AlphaZero in JAX☆73Updated 9 months ago
- A Repository with C++ implementations of Reinforcement Learning Algorithms (Pytorch)☆92Updated 5 years ago
- Pytorch Implementation of MuZero☆347Updated last year
- ReconChess python implementation☆42Updated 2 years ago
- ☆293Updated last month
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- A standalone release of DeepMind Lab's maze generator with Python bindings.☆60Updated last year
- Clone of OpenAI's Spinning Up in PyTorch☆146Updated 2 years ago