bhansconnect / alphazero-pybind11
A modified Alphazero implementation with C++ where performance matters.
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for alphazero-pybind11
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆41Updated last year
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆65Updated last year
- SpielViz is an interactive viewer for OpenSpiel games.☆28Updated 5 months ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆82Updated last year
- Implementation of Deepmind's AlphaZero algorithm with Caffe and C++☆19Updated 6 years ago
- AlphaZero in JAX☆69Updated 7 months ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆110Updated 3 years ago
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆88Updated 6 years ago
- A simple implementation of MuZero algorithm for connect4 game☆95Updated 4 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- OpenAI Gym environments for Legends of Code and Magic, a collectible card game designed for AI research☆35Updated last week
- ☆65Updated 3 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆39Updated last year
- ☆12Updated 2 years ago
- ☆80Updated last month
- Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games☆46Updated 2 months ago
- MiniZero: An AlphaZero and MuZero Training Framework☆72Updated 3 weeks ago
- Pytorch Implementation of MuZero☆343Updated last year
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 3 years ago
- A structured implementation of MuZero☆206Updated 2 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆36Updated 3 years ago
- ☆11Updated 2 years ago
- fast + parallel AlphaZero in JAX☆84Updated 7 months ago
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆45Updated 2 years ago
- Classic MCTS example with mctx☆15Updated last year
- Code for Learning to Synthesize Programs as Interpretable and Generalizable Policies in NeurIPS 2021☆33Updated 2 years ago
- Single player Alpha Zero implementation☆40Updated 2 years ago
- ☆48Updated last year
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆16Updated 3 years ago