ImparaAI / monte-carlo-tree-searchLinks
Library for running a Monte Carlo tree search, either traditionally or with expert policies
☆126Updated last year
Alternatives and similar repositories for monte-carlo-tree-search
Users that are interested in monte-carlo-tree-search are comparing it to the libraries listed below
Sorting:
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 4 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆117Updated 11 months ago
- ☆67Updated 3 years ago
- A structured implementation of MuZero☆204Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆148Updated 2 years ago
- Demo of UCT (MCTS) in Python / Numpy☆86Updated 2 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- Pytorch Implementation of MuZero☆353Updated last year
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆158Updated 4 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- 🌳 Python implementation of single-player Monte-Carlo Tree Search.☆63Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆46Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- fast + parallel AlphaZero in JAX☆97Updated 6 months ago
- A bare-bones Python library for quality diversity optimization.☆222Updated this week
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆50Updated 2 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- ☆304Updated 6 months ago
- ReconChess python implementation☆42Updated 3 years ago
- Python implementation of the genetic algorithm MAP-Elites with applications in constrained optimization☆53Updated 4 years ago
- Keeping track of RL experiments☆161Updated 2 years ago
- Single player Alpha Zero implementation☆42Updated 3 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Updated last year
- ☆182Updated 11 months ago
- ☆51Updated 2 years ago
- A leaderboard of human and machine performance on the Arcade Learning Environment (ALE).☆21Updated 6 years ago