HumanCompatibleAI / adversarial-policies
Find best-response to a fixed policy in multi-agent RL
☆276Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for adversarial-policies
- Tools for accelerating safe exploration research.☆506Updated last year
- A Python interface for reinforcement learning environments☆345Updated last year
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆533Updated last year
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆456Updated 7 months ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆169Updated last year
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆280Updated last year
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆393Updated last year
- Code for the paper "Phasic Policy Gradient"☆251Updated last year
- Repo for reproduction of sequential social dilemmas☆387Updated 4 months ago
- An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…☆190Updated 2 years ago
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆557Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆191Updated 2 years ago
- Keeping track of RL experiments☆159Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆479Updated last year
- Real-World RL Benchmark Suite☆346Updated 4 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆513Updated 3 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆302Updated 4 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆608Updated 4 years ago
- Multi Agent Reinforcement Learning using MalmÖ☆246Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆294Updated 2 years ago
- Code for conservative Q-learning☆409Updated 2 years ago
- A collection of multi agent environments based on OpenAI gym.☆570Updated 4 months ago
- PyTorch implementation of SAC-Discrete.☆284Updated 3 months ago
- The RL Reliability Metrics library provides a set of metrics for measuring the reliability of reinforcement learning (RL) algorithms, as …☆162Updated last year
- Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined wit…☆186Updated 3 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆253Updated 4 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆472Updated last year
- Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" publi…☆213Updated 4 years ago
- Vectorized interface for reinforcement learning environments☆141Updated last year
- List of competitions related to Reinforcement Learning☆346Updated 10 months ago