evg-tyurin / alpha-nagibator
Implementation of self-play based reinforcement learning for Checkers based on the AlphaGo Zero methods.
☆18Updated 6 years ago
Alternatives and similar repositories for alpha-nagibator:
Users that are interested in alpha-nagibator are comparing it to the libraries listed below
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 3 years ago
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- An implementation of Phasic Policy Gradient, a proposed improvement of Proximal Policy Gradients, in Pytorch☆51Updated 2 weeks ago
- Source code for "A deep dive into reinforcement learning"☆12Updated 5 years ago
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆69Updated last month
- AlphaZero in JAX☆73Updated 9 months ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆35Updated 3 years ago
- Understanding RL vision Distill article☆23Updated last year
- Research Into Learning to Generate Game Levels through Play☆31Updated 4 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆20Updated last year
- Evaluating different engineering tricks that make RL work☆15Updated 3 years ago
- Toribash Learning Environment☆49Updated last year
- ☆66Updated 3 years ago
- SpielViz is an interactive viewer for OpenSpiel games.☆28Updated 8 months ago
- GPT implementation in Flax☆18Updated 3 years ago
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆120Updated 8 months ago
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- ☆28Updated 2 years ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆39Updated 2 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- ☆13Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆31Updated last week
- Gym wrapper for Vizdoom environments☆12Updated 6 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆29Updated 5 months ago
- A C++ pytorch implementation of MuZero☆34Updated 8 months ago
- My Simple Implementation of AlphaGo Zero on Connect4☆18Updated 6 years ago
- A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each othe…☆156Updated 3 years ago
- Neuro-evolution for OpenAI Gym environments☆56Updated 3 years ago
- ☆48Updated last year
- Framework for inspecting actions and observatinos in StarCraftII replays☆20Updated 6 years ago