Quuxplusone / Hanabi
Framework for writing bots that play Hanabi.
☆36Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for Hanabi
- State of the art Hanabi bots + simulation framework in rust☆43Updated 11 months ago
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆45Updated 2 years ago
- Scaling scaling laws with board games.☆40Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆52Updated 7 years ago
- impact-driven-exploration☆126Updated last year
- SpielViz is an interactive viewer for OpenSpiel games.☆28Updated 5 months ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆96Updated 2 years ago
- A job launching library for docker, EC2, GCP, etc.☆57Updated 3 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆127Updated last year
- ☆13Updated 2 years ago
- ReconChess python implementation☆42Updated 2 years ago
- some common TD Learning algorithms☆67Updated 4 years ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆29Updated last year
- General Modules for JAX☆58Updated 3 months ago
- The code used to power DeepRole☆35Updated last year
- Benchmark environments for reward modelling and imitation learning algorithms.☆44Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- ☆102Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆59Updated 3 years ago
- A rewrite of hanabi-bot in Scala☆16Updated 3 years ago
- ☆85Updated 3 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆31Updated 4 years ago
- ☆35Updated 6 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆30Updated 6 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- ☆20Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago