facebookresearch / diplomacy_searchbotLinks
Code to accompany "Human-Level Performance in No-Press Diplomacy via Equilibrium Search", published at ICLR 2021
☆47Updated 2 years ago
Alternatives and similar repositories for diplomacy_searchbot
Users that are interested in diplomacy_searchbot are comparing it to the libraries listed below
Sorting:
- Supervised and RL Models for No Press Diplomacy☆66Updated 2 years ago
- Code for magnetic mirror descent.☆16Updated last year
- Multi-Agent RL Environment for the Stratego Board Game (and variants)☆34Updated last year
- Scalable implementation of DREAM - Deep RL for multi-agent imperfect information games☆117Updated 10 months ago
- Neural Fictitious Self-Play in Leduc Holdem☆10Updated 6 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆148Updated 2 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆101Updated 2 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- The source code for the gym-microrts paper.☆42Updated 2 years ago
- impact-driven-exploration☆131Updated last year
- Scalable Implementation of Neural Fictitous Self-Play☆80Updated 6 years ago
- PAIRED in PyTorch 🔥☆60Updated 2 years ago
- ☆301Updated 5 months ago
- ☆65Updated last year
- ☆55Updated last year
- Results reproductions & comparisons between OpenSpiel implementations, associated paper & originating works☆16Updated 4 years ago
- Code for 'The Grand Atari Challenge dataset' paper☆53Updated 7 years ago
- The code used to power DeepRole☆36Updated 2 years ago
- Library to compare and evaluate reward functions☆67Updated last year
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆161Updated 3 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆37Updated 5 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆128Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆113Updated 9 months ago
- General Modules for JAX☆66Updated 2 months ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆39Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year