suragnair / alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
☆4,101Updated 3 months ago
Alternatives and similar repositories for alpha-zero-general:
Users that are interested in alpha-zero-general are comparing it to the libraries listed below
- ☆13Updated 4 months ago
- Chess reinforcement learning by AlphaGo Zero methods.☆2,161Updated 2 years ago
- MuZero☆2,615Updated 7 months ago
- Reversi reinforcement learning by AlphaGo Zero methods.☆678Updated 2 years ago
- OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.☆4,482Updated last week
- A fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆4,252Updated 2 years ago
- An open-source implementation of the AlphaGoZero algorithm☆3,493Updated 4 years ago
- A replica of the AlphaZero methodology for deep reinforcement learning in Python☆2,033Updated 2 years ago
- Modularized Implementation of Deep RL Algorithms in PyTorch☆3,289Updated last year
- An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.☆670Updated last year
- Reinforcement Learning in PyTorch☆2,250Updated 4 years ago
- A Platform for Many-Agent Reinforcement Learning☆1,720Updated 2 years ago
- ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation☆3,388Updated 5 years ago
- Monte carlo tree search in python☆599Updated 2 years ago
- Tensorforce: a TensorFlow library for applied reinforcement learning☆3,310Updated 8 months ago
- Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.☆4,036Updated this week
- TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.☆2,895Updated last month
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,210Updated 8 months ago
- An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)☆3,458Updated 11 months ago
- Replicating AlphaGo's architecture in a readable manner☆1,157Updated 5 years ago
- An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities☆2,875Updated this week
- Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL☆3,088Updated 3 years ago
- A library of reinforcement learning components and agents☆3,640Updated 3 months ago
- An End-To-End, Lightweight and Flexible Platform for Game Research☆2,089Updated 3 years ago
- Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillio…☆3,343Updated 10 months ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆211Updated 2 years ago
- Code for the paper "Exploration by Random Network Distillation"☆894Updated 4 years ago
- A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.☆1,166Updated 2 years ago
- Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow☆1,940Updated this week
- A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)☆3,610Updated last month