hayoung-kim / mcts-tic-tac-toe
Monte Carlo Tree Search for tic tac toe
☆36Updated 6 years ago
Alternatives and similar repositories for mcts-tic-tac-toe:
Users that are interested in mcts-tic-tac-toe are comparing it to the libraries listed below
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆211Updated 2 years ago
- A structured implementation of MuZero☆204Updated 2 years ago
- Pytorch Implementation of MuZero☆351Updated last year
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- This project is implementation code of AlphaStar☆199Updated last year
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆81Updated 5 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆157Updated 4 years ago
- An environment of the board game Go using OpenAI's Gym API☆173Updated 2 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆115Updated 9 months ago
- ☆39Updated 4 years ago
- ☆67Updated 3 years ago
- Series of deep reinforcement learning algorithms 🤖☆29Updated 3 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆118Updated 3 years ago
- ☆299Updated 4 months ago
- A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartP…☆109Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆198Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆77Updated 6 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 9 months ago
- Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.☆96Updated 4 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 4 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆366Updated last year
- Code release for Learning with Opponent-Learning Awareness and variations.☆147Updated 2 years ago
- A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)☆246Updated 9 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆49Updated 2 years ago
- Lightweight multi-agent gridworld Gym environment☆204Updated last year
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆311Updated 3 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21Updated last year
- Demo of UCT (MCTS) in Python / Numpy☆85Updated 2 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆73Updated 4 months ago
- A collection of pre-trained RL agents using Stable Baselines3☆125Updated 5 months ago