Quuxplusone / HanabiLinks
Framework for writing bots that play Hanabi.
☆36Updated 6 years ago
Alternatives and similar repositories for Hanabi
Users that are interested in Hanabi are comparing it to the libraries listed below
Sorting:
- State of the art Hanabi bots + simulation framework in rust☆45Updated last year
- Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021☆47Updated 2 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated 2 years ago
- StarCraft: BroodWars OpenAI Gym environment☆83Updated 6 years ago
- some common TD Learning algorithms☆66Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆53Updated 7 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Updated 5 years ago
- impact-driven-exploration☆131Updated last year
- Code release for Learning with Opponent-Learning Awareness and variations.☆149Updated 2 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆83Updated 6 years ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- ☆65Updated last year
- OpenAI Retro Contest☆65Updated 2 years ago
- ☆43Updated 8 years ago
- This package allows to use PLE as a gym environment.☆72Updated 5 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆50Updated 2 years ago
- ☆44Updated 6 years ago
- ☆84Updated 4 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆101Updated 3 years ago
- A StarCraft 2 agent for harvesting resources☆13Updated 7 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Some baselines for Pommerman competition☆46Updated 7 years ago
- A framework for experimenting with never-ending learning☆79Updated 9 months ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆63Updated last year
- ☆25Updated 3 years ago
- ☆67Updated 3 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago