jseppanen / azalea
Hex board game AI with self-play learning based on the AlphaZero algorithm
☆31Updated 4 years ago
Related projects: ⓘ
- ☆13Updated 2 years ago
- Learning from zero (mostly based off of AlphaZero) in General Game Playing.☆82Updated last year
- A python implemenation of tabular MuZero for educational purposes☆21Updated 4 years ago
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆44Updated 2 years ago
- Source code of the MaastCTS2 agent for General Video Game playing. Champion of the 2016 GVG-AI Single-Player Track, and runner-up of the …☆14Updated 2 years ago
- SeqGAN but with more bells and whistles☆24Updated 6 years ago
- krazy grid world☆25Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- A clean implementation based on Expert Iterations for any game, inspired by alpha-zero-general☆40Updated last year
- AlphaZero implemented for Hex☆22Updated 6 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated last year
- Agent to play the game Hex, based on the Expert Iteration from the paper Thinking Fast and Slow with Deep Learning and Tree Search (NIPS …☆7Updated 6 years ago
- ☆44Updated 5 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Updated 6 years ago
- Scaling scaling laws with board games.☆36Updated last year
- Code and links for over 25,000 trained Atari agents☆92Updated 3 weeks ago
- Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).☆13Updated 2 years ago
- PyTorch code to train and evaluate Procgen tasks☆23Updated 3 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- learning to play atari games with reinforcement learning☆10Updated 8 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- An environment for benchmarking commonsense agents☆28Updated 4 years ago
- Convert sc2 environment to gym-atari and play some mini-games☆21Updated 6 years ago
- Nethack Learning Environment Wrapper for Language Interface☆33Updated last year
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- ☆21Updated 2 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Code for human intervention reinforcement learning☆33Updated 6 years ago
- ☆29Updated 2 years ago