bhansconnect / alphazero-pybind11Links
A modified Alphazero implementation with C++ where performance matters.
☆17Updated last month
Alternatives and similar repositories for alphazero-pybind11
Users that are interested in alphazero-pybind11 are comparing it to the libraries listed below
Sorting:
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆79Updated 8 months ago
- ☆89Updated 7 months ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆84Updated 2 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆160Updated 4 years ago
- SpielViz is an interactive viewer for OpenSpiel games.☆34Updated last year
- Reproduction of AlphaTensor paper for 2x2 matrices☆17Updated last year
- Implementation of Deepmind's AlphaZero algorithm with Caffe and C++☆19Updated 7 years ago
- A structured implementation of MuZero☆205Updated 3 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆83Updated 6 years ago
- Pytorch Implementation of MuZero☆353Updated 2 years ago
- ☆67Updated 3 years ago
- AlphaZero in JAX☆78Updated last year
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆90Updated 7 years ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆174Updated 2 years ago
- AlphaZero implementation on Gomoku☆18Updated 6 months ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 5 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆98Updated last month
- CuLE: A CUDA port of the Atari Learning Environment (ALE)☆241Updated 2 years ago
- An environment of the board game Go using OpenAI's Gym API☆175Updated 3 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- Code for Learning to Synthesize Programs as Interpretable and Generalizable Policies in NeurIPS 2021☆39Updated 2 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- Parallel Monte Carlo Tree Search, see README.md for more detailed usage and information.☆49Updated 4 years ago
- Generative Neuro-Symbolic (GNS) Modeling (Feinman & Lake, 2021)☆27Updated 4 years ago
- A curated list of papers related to program synthesis, program induction, program execution, program and code repair, and programmatic re…☆168Updated 3 years ago
- Levin tree search guided by both a policy and a heuristic function☆19Updated 2 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated 2 years ago