evg-tyurin / alpha-nagibatorLinks
Implementation of self-play based reinforcement learning for Checkers based on the AlphaGo Zero methods.
☆18Updated 7 years ago
Alternatives and similar repositories for alpha-nagibator
Users that are interested in alpha-nagibator are comparing it to the libraries listed below
Sorting:
- A C++ pytorch implementation of MuZero☆38Updated last year
- SpielViz is an interactive viewer for OpenSpiel games.☆32Updated last year
- AlphaZero in JAX☆77Updated last year
- A fast, generalized, and modified implementation of Deepmind's distinguished AlphaZero in PyTorch.☆76Updated 5 months ago
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Updated 3 years ago
- This contains the Flax model of min(DALL·E) and code for converting it to PyTorch☆45Updated 2 years ago
- PyTorch AlphaZero implementation with multiplayer support [NeurIPS 2019 Deep Reinforcement Learning Workshop]☆34Updated 4 years ago
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Updated 4 years ago
- AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Maste…☆90Updated 7 years ago
- Evaluating different engineering tricks that make RL work☆15Updated 4 years ago
- Reinforcement learning in pure JAX.☆13Updated 3 months ago
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆44Updated 2 years ago
- Applying DeepMind's MuZero algorithm to the cart pole environment in gym☆21Updated 2 years ago
- The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with☆215Updated 2 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- Using self-play, MCTS, and a deep neural network to create a hearthstone ai player☆29Updated 6 years ago
- ☆51Updated 2 years ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆26Updated 2 years ago
- Interactive GAN evolution of Mario and Zelda levels.☆54Updated last year
- Gym wrapper for Vizdoom environments☆12Updated 6 years ago
- Understanding RL vision Distill article☆23Updated 2 years ago
- RL environment replicating the werewolf game to study emergent communication☆19Updated 2 years ago
- Deep reinforcement learning implementation that trains AIs for the CodeCraft real-time strategy game.☆21Updated 2 years ago
- A stateful pytree library for training neural networks.☆22Updated 2 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆45Updated 3 years ago
- PyTorch implementation of AlphaZero Connect from scratch (with results)☆81Updated 5 years ago
- Code for the paper LazImpa: Lazy and Impatient neural agents learn to communicate efficiently. Mathieu Rita, Rahma Chaabouni and Emmanuel…☆16Updated 4 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆31Updated 10 months ago
- AdaCat☆49Updated 2 years ago