facebookresearch / diplomacy_searchbot
Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021
☆44Updated 2 years ago
Related projects: ⓘ
- The code used to power DeepRole☆35Updated last year
- Supervised and RL Models for No Press Diplomacy☆59Updated last year
- ☆45Updated 4 months ago
- Code for magnetic mirror descent.☆13Updated 11 months ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Scalable Implementation of Neural Fictitous Self-Play☆71Updated 5 years ago
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆108Updated last month
- impact-driven-exploration☆125Updated 11 months ago
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆29Updated last year
- ☆21Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆102Updated 3 weeks ago
- Nethack Learning Environment Wrapper for Language Interface☆33Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- ☆85Updated 3 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆140Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- A Python Toolkit for Managing a Large Number of Experiments☆30Updated 7 months ago
- Scaling scaling laws with board games.☆36Updated last year
- PyTorch code to train and evaluate Procgen tasks☆23Updated 3 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 4 years ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 6 years ago
- MultiTask Environments for Reinforcement Learning.☆74Updated 2 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆95Updated 2 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆44Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆16Updated last year
- ☆282Updated last year
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆101Updated last year
- ReconChess python implementation☆42Updated 2 years ago