facebookresearch / diplomacy_searchbot
Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021
☆45Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for diplomacy_searchbot
- Code for magnetic mirror descent.☆15Updated last year
- Supervised and RL Models for No Press Diplomacy☆60Updated last year
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆30Updated last year
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- impact-driven-exploration☆128Updated last year
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆97Updated 2 years ago
- The code used to power DeepRole☆35Updated 2 years ago
- Scaling scaling laws with board games.☆43Updated last year
- ☆85Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆73Updated 5 years ago
- ☆48Updated 7 months ago
- Implementation of the Off Belief Learning algorithm.☆45Updated 2 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆112Updated 4 months ago
- Framework for writing bots that play Hanabi.☆36Updated 5 years ago
- Nethack Learning Environment Wrapper for Language Interface☆34Updated last year
- Benchmark environments for reward modelling and imitation learning algorithms.☆44Updated last year
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- ReconChess python implementation☆42Updated 2 years ago
- ☆66Updated 8 months ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆17Updated 2 years ago
- ☆21Updated 2 years ago
- Code for "Unsupervised State Representation Learning in Atari"☆242Updated last year
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago
- cfrx is a collection of algorithms and tools for hardware-accelerated Counterfactual Regret Minimization (CFR) algorithms in Jax.☆27Updated 3 months ago
- Reinforcement Learning Assembly☆92Updated 3 years ago
- MultiTask Environments for Reinforcement Learning.☆74Updated 2 years ago