forestagostinelli / DeepCubeALinks
Code for DeepCubeA, a Deep Reinforcement Learning algorithm that can learn to solve the Rubik's cube.
☆181Updated 7 months ago
Alternatives and similar repositories for DeepCubeA
Users that are interested in DeepCubeA are comparing it to the libraries listed below
Sorting:
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆217Updated 2 years ago
- A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games☆143Updated 8 months ago
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆83Updated 5 years ago
- An environment of the board game Go using OpenAI's Gym API☆175Updated 3 years ago
- ☆222Updated last year
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆159Updated 4 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆78Updated 7 months ago
- ♟️ Vectorized RL game environments in JAX☆498Updated 4 months ago
- AlphaZero in JAX☆78Updated last year
- A structured implementation of MuZero☆204Updated 3 years ago
- fast + parallel AlphaZero in JAX☆97Updated 6 months ago
- MiniZero: An AlphaZero and MuZero Training Framework☆94Updated 4 months ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- A simple chess environment for openai/gym☆161Updated last year
- Code for 1st place solution to Kaggle's Abstraction and Reasoning Challenge☆157Updated last week
- ☆374Updated 3 years ago
- Pytorch Implementation of MuZero☆353Updated last year
- PyTorch implementation of AlphaZero Chess from scratch☆165Updated 11 months ago
- A simple implementation of MuZero algorithm for connect4 game☆96Updated 4 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆83Updated 2 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆119Updated 4 years ago
- An OpenAI Gym environment for the Flappy Bird game☆126Updated 3 years ago
- Solving the Rubik's cube with deep reinforcement learning and Monte Carlo tree search☆103Updated 6 years ago
- ☆86Updated 6 months ago
- Domain Specific Language for the Abstraction and Reasoning Corpus☆271Updated 9 months ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆117Updated 11 months ago
- Classic MCTS example with mctx☆18Updated 2 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆126Updated last year
- For code etc relating to the network training process.☆162Updated last year
- ☆13Updated 6 months ago