google-deepmind / diplomacyLinks
☆61Updated last year
Alternatives and similar repositories for diplomacy
Users that are interested in diplomacy are comparing it to the libraries listed below
Sorting:
- Interpreting how transformers simulate agents performing RL tasks☆90Updated 2 years ago
- ☆47Updated last year
- Learning diverse options through the Laplacian representation.☆23Updated 2 years ago
- CookingZoo: a gym-cooking derivative to simulate a complex cooking environment☆21Updated last year
- Scalable Opponent Shaping Experiments in JAX☆25Updated last year
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆76Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆109Updated last year
- A categorised list of Multi-Agent Reinforcemnt Learning (MARL) papers☆56Updated 3 years ago
- Baselines for gymnax 🤖☆74Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆122Updated last year
- ☆37Updated 2 years ago
- Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.☆138Updated last year
- Efficient baselines for autocurricula in JAX.☆206Updated last year
- Supervised and RL Models for No Press Diplomacy☆74Updated 2 months ago
- Baselines for Neural MMO -- new users should treat this repo as a starter project☆51Updated last year
- [NeurIPS 2022] Open source code for reusing prior computational work in RL.☆100Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆49Updated 3 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆106Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Updated 2 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆31Updated 5 months ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Updated last year
- ☆57Updated last year
- A collection of matrix games in JAX☆13Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Updated 2 years ago
- Object Centric Atari games☆99Updated 2 months ago
- Repo to reproduce the First-Explore paper results☆39Updated last year
- Contains JAX implementation of algorithms for inverse reinforcement learning☆74Updated last year
- Code and links for over 25,000 trained Atari agents☆98Updated last year
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆90Updated 3 weeks ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆48Updated 3 years ago