HumanCompatibleAI / adversarial-policiesLinks
Find best-response to a fixed policy in multi-agent RL
☆288Updated 3 years ago
Alternatives and similar repositories for adversarial-policies
Users that are interested in adversarial-policies are comparing it to the libraries listed below
Sorting:
- Tools for accelerating safe exploration research.☆555Updated 2 years ago
- Keeping track of RL experiments☆163Updated 2 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆205Updated 3 years ago
- Real-World RL Benchmark Suite☆356Updated 5 years ago
- An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…☆194Updated 2 years ago
- A Python interface for reinforcement learning environments☆377Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆265Updated 2 years ago
- RAD: Reinforcement Learning with Augmented Data☆414Updated 4 years ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆176Updated 2 years ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆479Updated last year
- Dream to Control: Learning Behaviors by Latent Imagination☆554Updated 4 years ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆441Updated 2 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆318Updated 2 years ago
- A PyTorch library for building deep reinforcement learning agents.☆652Updated last year
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆555Updated 2 years ago
- Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms☆302Updated 2 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆501Updated 2 years ago
- Gridworld for MARL experiments☆142Updated 4 years ago
- Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.☆84Updated 4 years ago
- A basic 2D maze environment where an agent start from the top left corner and try to find its way to the bottom left corner.☆372Updated 2 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆325Updated 3 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆262Updated 5 years ago
- Repo for reproduction of sequential social dilemmas☆406Updated 7 months ago
- The RL Reliability Metrics library provides a set of metrics for measuring the reliability of reinforcement learning (RL) algorithms, as …☆164Updated 2 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆371Updated 3 years ago
- Multitask Environments for RL☆280Updated 4 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆510Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆665Updated 5 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆593Updated 4 years ago
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆575Updated 2 years ago