HumanCompatibleAI / seals
Benchmark environments for reward modelling and imitation learning algorithms.
β46Updated last year
Alternatives and similar repositories for seals:
Users that are interested in seals are comparing it to the libraries listed below
- Library to compare and evaluate reward functionsβ66Updated last year
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.β68Updated last year
- PAIRED in PyTorch π₯β58Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the β¦β85Updated 3 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Reβ¦β101Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".β61Updated last year
- Efficient Exploration via State Marginal Matching (2019)β67Updated 5 years ago
- impact-driven-explorationβ130Updated last year
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)β77Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suiteβ210Updated 9 months ago
- rllab's viskit with some added featuresβ73Updated last year
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"β83Updated 5 years ago
- A collection of RL algorithms written in JAX.β95Updated 2 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.β12Updated 2 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.β83Updated 5 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPOβ193Updated 2 years ago
- Reward Learning by Simulating the Pastβ44Updated 5 years ago
- β47Updated 4 years ago
- Code for "Learning to Reach Goals via Iterated Supervised Learning"β76Updated 2 years ago
- β111Updated last year
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"β188Updated 2 years ago
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.β130Updated 6 months ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.β23Updated 4 years ago
- Revisiting Rainbowβ74Updated 3 years ago
- Benchmarking RL generalization in an interpretable way.β142Updated last week
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objectiveβ80Updated last year
- Code for "On the Utility of Learning about Humans for Human-AI Coordination"β108Updated last year
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRLβ109Updated 5 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learningβ108Updated 3 years ago
- Accompanying code for "Learning and Planning in Average-Reward Markov Decision Processes"β14Updated 4 years ago