plkmo / AlphaZero_Connect4Links

PyTorch implementation of AlphaZero Connect from scratch (with results)

☆84

Alternatives and similar repositories for AlphaZero_Connect4

Users that are interested in AlphaZero_Connect4 are comparing it to the libraries listed below

Sorting:

huangeddie / GymGo
An environment of the board game Go using OpenAI's Gym API
☆175Updated 3 years ago
blanyal / alpha-zero
AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…
☆90Updated 7 years ago
johan-gras / MuZero
A structured implementation of MuZero
☆205Updated 3 years ago
koulanurag / muzero-pytorch
Pytorch Implementation of MuZero
☆354Updated 2 years ago
JoshVarty / AlphaZeroSimple
The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with
☆218Updated 2 years ago
genyrosk / gym-chess
A simple chess environment for openai/gym
☆161Updated last year
kaesve / muzero
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…
☆160Updated 4 years ago
YuriCat / MuZeroJupyterExample
☆67Updated 3 years ago
uber-research / go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
☆574Updated 2 years ago
Zeta36 / connect4-alpha-zero
Connect4 reinforcement learning by AlphaGo Zero methods.
☆113Updated 4 years ago
Zeta36 / muzero
A simple implementation of MuZero algorithm for connect4 game
☆96Updated 4 years ago
int8 / monte-carlo-tree-search
Monte carlo tree search in python
☆610Updated 3 years ago
google-deepmind / dqn_zoo
DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…
☆476Updated last year
bhansconnect / fast-alphazero-general
A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general
☆44Updated 2 years ago
mrahtz / learning-from-human-preferences
Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"
☆325Updated 3 years ago
grantsrb / Gym-Snake
An OpenAI gym environment made for RL
☆70Updated last year
mpSchrader / gym-sokoban
Sokoban environment for OpenAI Gym
☆377Updated last year
Shmuma / ptan
PyTorch Agent Net: reinforcement learning toolkit for pytorch
☆546Updated 9 months ago
hardmaru / WorldModelsExperiments
World Models Experiments
☆652Updated 2 years ago
facebookresearch / torchbeast
A PyTorch Platform for Distributed RL
☆749Updated 3 years ago
cyoon1729 / RLcycle
A library for ready-made reinforcement learning agents and reusable components for neat prototyping
☆301Updated last year
hardmaru / slimevolleygym
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
☆759Updated last year
MattChanTK / gym-maze
A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.
☆371Updated last year
Bam4d / Griddly
A grid-world game engine for game AI research
☆246Updated last year
rgilman33 / simple-A2C-PPO
Actor-critic trained w PPO on OpenAI's Procgen Benchmark (PyTorch). Built from scratch.
☆102Updated 5 years ago
google-deepmind / hanabi-learning-environment
hanabi_learning_environment is a research platform for Hanabi experiments.
☆655Updated 2 years ago
hrpan / tetris_mcts
MCTS project for Tetris
☆348Updated 9 months ago
lilianweng / deep-reinforcement-learning-gym
Deep reinforcement learning model implementation in Tensorflow + OpenAI gym
☆299Updated 2 years ago
brilee / python_uct
Demo of UCT (MCTS) in Python / Numpy
☆88Updated 2 years ago
david-abel / simple_rl
A simple framework for experimenting with Reinforcement Learning in Python.
☆317Updated last year