jseppanen / azalea
Hex board game AI with self-play learning based on the AlphaZero algorithm
☆34Updated 5 years ago
Alternatives and similar repositories for azalea:
Users that are interested in azalea are comparing it to the libraries listed below
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆81Updated 2 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- Scaling scaling laws with board games.☆48Updated last year
- Code for 'The Grand Atari Challenge dataset' paper☆52Updated 7 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- ☆35Updated 6 years ago
- This is my implementation of the Optimality Tightening☆37Updated 7 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- Skip Context Tree Switching - Reference Implementation☆49Updated 7 years ago
- General Modules for JAX☆64Updated 2 weeks ago
- Code and links for over 25,000 trained Atari agents☆94Updated 6 months ago
- An implementation of MuZero in JAX.☆56Updated 2 years ago
- ☆50Updated last year
- Source code of the MaastCTS2 agent for General Video Game playing. Champion of the 2016 GVG-AI Single-Player Track, and runner-up of the …☆14Updated 3 years ago
- Coin collector game in Microsoft TextWorld, and a simple RL agent solving it.☆36Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆95Updated 6 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆53Updated 2 years ago
- Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).☆14Updated 3 years ago
- ☆44Updated 6 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Updated 6 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 5 years ago
- AlphaZero implemented for Hex☆23Updated 6 years ago
- ☆73Updated 4 months ago
- A PyTorch implementation of DeepMind's MCTSnet☆18Updated 2 years ago
- A platform of grid world that supports up to 1 million reinforcement-learning agents.☆69Updated 7 years ago
- impact-driven-exploration☆130Updated last year
- Agent to play the game Hex, based on the Expert Iteration from the paper Thinking Fast and Slow with Deep Learning and Tree Search (NIPS …☆7Updated 6 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆146Updated last year
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago