plkmo / AlphaZero_Connect4
PyTorch implementation of AlphaZero Connect from scratch (with results)
☆82Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for AlphaZero_Connect4
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- A structured implementation of MuZero☆206Updated 2 years ago
- Pytorch Implementation of MuZero☆343Updated last year
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆190Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆95Updated 4 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- ☆65Updated 3 years ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆42Updated last year
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆112Updated 3 months ago
- A simple chess environment for openai/gym☆151Updated 8 months ago
- An OpenAI gym environment made for RL☆65Updated 11 months ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- A collection of baselines for the MineRL environment/datasets & the NeurIPS 2021 MineRL competitions☆146Updated 3 years ago
- Monte Carlo Tree Search for tic tac toe☆34Updated 6 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆65Updated last year
- The submission template for the MineRL Competition @ NeurIPS 2021. Clone this to make a new submission!☆92Updated 3 years ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆457Updated 7 months ago
- A grid-world game engine for game AI research☆233Updated 7 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆191Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆112Updated 3 years ago
- The project is a platform of zero learning with a library of games.☆266Updated 3 years ago
- ☆102Updated 4 years ago
- OpenAI Gym wrapper for ViZDoom enviroments☆66Updated 3 years ago
- ☆286Updated last year
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated last year
- Open AI gym environment for the game 2048☆71Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- A PyTorch Library for Reinforcement Learning Research☆196Updated last year
- A PyTorch Platform for Distributed RL☆738Updated 3 years ago
- Demo of UCT (MCTS) in Python / Numpy☆83Updated last year