suragnair / alpha-zero-generalLinks
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
☆4,372Updated last year
Alternatives and similar repositories for alpha-zero-general
Users that are interested in alpha-zero-general are comparing it to the libraries listed below
Sorting:
- ☆13Updated last year
- MuZero☆2,766Updated last year
- Chess reinforcement learning by AlphaGo Zero methods.☆2,209Updated 2 years ago
- A replica of the AlphaZero methodology for deep reinforcement learning in Python☆2,034Updated 3 years ago
- An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)☆3,590Updated last year
- Reversi reinforcement learning by AlphaGo Zero methods.☆687Updated 3 years ago
- An open-source implementation of the AlphaGoZero algorithm☆3,531Updated 4 years ago
- Implement AlphaZero/AlphaGo Zero methods on Chinese chess.☆1,203Updated 2 years ago
- OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.☆5,022Updated this week
- The Arcade Learning Environment (ALE) -- a platform for AI research.☆2,392Updated last month
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆230Updated 2 years ago
- Monte carlo tree search in python☆625Updated 3 years ago
- A fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆4,319Updated 3 years ago
- PyTorch implementation of AlphaZero Chess from scratch☆181Updated last year
- ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation☆3,415Updated 6 years ago
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆93Updated 7 years ago
- Rainbow: Combining Improvements in Deep Reinforcement Learning☆1,660Updated 4 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,875Updated 3 years ago
- Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th …☆345Updated 3 years ago
- AlphaZero implemented Chinese chess. AlphaGo Zero / AlphaZero实践项目,实现中国象棋。☆522Updated 2 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.☆689Updated last year
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆218Updated 11 months ago
- Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL☆3,159Updated 4 years ago
- Deep Reinforcement Learning for Keras.☆5,554Updated 2 years ago
- Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning☆2,578Updated 6 years ago
- A library of reinforcement learning components and agents☆3,914Updated last week
- Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.☆5,561Updated last year
- Simple and easily configurable grid world environments for reinforcement learning☆2,398Updated last month
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,471Updated 3 years ago