evg-tyurin / alpha-nagibator
Implementation of self-play based reinforcement learning for Checkers based on the AlphaGo Zero methods.
☆18Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for alpha-nagibator
- Connect4 reinforcement learning by AlphaGo Zero methods.☆114Updated 3 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆66Updated last year
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch☆46Updated this week
- An environment of the board game Go using OpenAI's Gym API☆168Updated 2 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 3 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆27Updated 3 months ago
- SpielViz is an interactive viewer for OpenSpiel games.☆28Updated 6 months ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆42Updated last year
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- A simple implementation of MuZero algorithm for connect4 game☆95Updated 4 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆20Updated last year
- Python Implementation of Parameter-exploring Policy Gradients Evolution Strategy☆15Updated 4 years ago
- ☆20Updated 5 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆31Updated 6 years ago
- ☆28Updated 2 years ago
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- ☆21Updated 4 years ago
- Model-Based RL Demo for Pendulum-v0☆13Updated 4 years ago
- AlphaZero in JAX☆69Updated 7 months ago
- ☆17Updated last year
- My Simple Implementation of AlphaGo Zero on Connect4☆18Updated 6 years ago
- A leaderboard of human and machine performance on the Arcade Learning Environment (ALE).☆22Updated 6 years ago
- Neuro-evolution for OpenAI Gym environments☆56Updated 3 years ago
- Old and new Reinforcement Learning algorithms run on the GridUniverse ecosystem☆22Updated 5 years ago
- StarCraft II Reinforcement Learning with Pytorch - Mini Games☆23Updated 6 years ago
- Interactive GAN evolution of Mario and Zelda levels.☆54Updated last year
- ☆65Updated 3 years ago
- Gym wrapper for Vizdoom environments☆12Updated 5 years ago