Quuxplusone / HanabiLinks
Framework for writing bots that play Hanabi.
☆37Updated 6 years ago
Alternatives and similar repositories for Hanabi
Users that are interested in Hanabi are comparing it to the libraries listed below
Sorting:
- A simulator for strategies of a well-known cooperative card game.☆15Updated 9 years ago
- State of the art Hanabi bots + simulation framework in rust☆45Updated last year
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated 2 years ago
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆48Updated 3 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆53Updated 7 years ago
- ☆13Updated 3 years ago
- Reinforcement Learning Assembly☆92Updated 4 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆149Updated 2 years ago
- The code used to power DeepRole☆36Updated 2 years ago
- Some hard problems for reinforcement learning.☆31Updated 6 years ago
- This package allows to use PLE as a gym environment.☆72Updated 5 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆83Updated 6 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆118Updated last year
- An environment for benchmarking commonsense agents☆29Updated 5 years ago
- A framework for experimenting with never-ending learning☆79Updated 10 months ago
- impact-driven-exploration☆132Updated last year
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- A novel parallel UCT algorithm with linear speedup and negligible performance loss.☆120Updated 4 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆102Updated 3 years ago
- ☆35Updated 7 years ago
- AlphaZero in JAX☆78Updated last year
- 2019 talk at GECCO☆68Updated 6 years ago
- ☆67Updated 3 years ago
- Scaling scaling laws with board games.☆53Updated 2 years ago
- ☆84Updated 4 years ago
- ☆106Updated 5 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago