suragnair / alpha-zero-generalLinks
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
☆4,146Updated 4 months ago
Alternatives and similar repositories for alpha-zero-general
Users that are interested in alpha-zero-general are comparing it to the libraries listed below
Sorting:
- Chess reinforcement learning by AlphaGo Zero methods.☆2,166Updated 2 years ago
- MuZero☆2,648Updated 8 months ago
- A replica of the AlphaZero methodology for deep reinforcement learning in Python☆2,033Updated 2 years ago
- An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)☆3,489Updated last year
- Reversi reinforcement learning by AlphaGo Zero methods.☆678Updated 2 years ago
- ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation☆3,395Updated 5 years ago
- An open-source implementation of the AlphaGoZero algorithm☆3,492Updated 4 years ago
- OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.☆4,543Updated this week
- Monte carlo tree search in python☆601Updated 2 years ago
- Tensorforce: a TensorFlow library for applied reinforcement learning☆3,312Updated 10 months ago
- A fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆4,275Updated 2 years ago
- OpenAI Baselines: high-quality implementations of reinforcement learning algorithms☆16,294Updated 9 months ago
- Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL☆3,101Updated 3 years ago
- TensorFlow Reinforcement Learning☆3,137Updated 2 years ago
- bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent☆1,522Updated last year
- TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.☆2,907Updated last month
- An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku☆209Updated 3 months ago
- An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.☆674Updated last year
- A Platform for Many-Agent Reinforcement Learning☆1,734Updated 2 years ago
- Modularized Implementation of Deep RL Algorithms in PyTorch☆3,309Updated last year
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,770Updated 3 years ago
- A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.☆1,177Updated 2 years ago
- Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms☆2,348Updated 2 years ago
- Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.☆5,461Updated last year
- Simple and easily configurable grid world environments for reinforcement learning☆2,243Updated 3 months ago
- Replicating AlphaGo's architecture in a readable manner☆1,158Updated 5 years ago
- A student implementation of Alpha Go Zero☆280Updated 6 years ago
- Deep Reinforcement Learning for Keras.☆5,553Updated last year
- Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillio…☆3,378Updated last year
- An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES☆762Updated last year