Quuxplusone / Hanabi
Framework for writing bots that play Hanabi.
☆36Updated 5 years ago
Alternatives and similar repositories for Hanabi:
Users that are interested in Hanabi are comparing it to the libraries listed below
- State of the art Hanabi bots + simulation framework in rust☆44Updated last year
- A simulator for strategies of a well-known cooperative card game.☆15Updated 9 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆76Updated 6 years ago
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆46Updated 2 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆52Updated 7 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆115Updated 8 months ago
- A python implemenation of tabular MuZero for educational purposes☆21Updated 5 years ago
- 🃏♠️♥️♦️♣️☆26Updated 4 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- ☆13Updated 2 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆100Updated 2 years ago
- The code used to power DeepRole☆35Updated 2 years ago
- ☆67Updated 3 years ago
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆116Updated 3 years ago
- A simple implementation of MuZero algorithm for connect4 game☆97Updated 4 years ago
- impact-driven-exploration☆130Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆94Updated 6 years ago
- This repository contains levels for boxoban, a box-pushing puzzle game inspired by Sokoban.☆68Updated 2 years ago
- Source code of the MaastCTS2 agent for General Video Game playing. Champion of the 2016 GVG-AI Single-Player Track, and runner-up of the …☆14Updated 3 years ago
- 2019 talk at GECCO☆68Updated 5 years ago
- Scaling scaling laws with board games.☆48Updated last year
- Code for magnetic mirror descent.☆16Updated last year
- AlphaZero in JAX☆77Updated last year
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆30Updated 7 months ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Updated 6 years ago
- A structured implementation of MuZero☆207Updated 2 years ago
- ☆22Updated this week
- Library for running a Monte Carlo tree search, either traditionally or with expert policies☆122Updated 11 months ago