HumanCompatibleAI / seals
Benchmark environments for reward modelling and imitation learning algorithms.
☆44Updated last year
Related projects ⓘ
Alternatives and complementary repositories for seals
- A tool for recording RL trajectories.☆94Updated last week
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago
- Baselines for gymnax 🤖☆60Updated last year
- Library to compare and evaluate reward functions☆61Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆105Updated 2 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆67Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆75Updated 11 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆133Updated last year
- Object Centric Atari games☆48Updated this week
- PAIRED in PyTorch 🔥☆56Updated last year
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆143Updated 3 years ago
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆76Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- Revisiting Rainbow☆73Updated 3 years ago
- Recurrent continuous reinforcement learning algorithms implemented in Pytorch.☆50Updated 3 years ago
- impact-driven-exploration☆128Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suite☆205Updated 6 months ago
- rllab's viskit with some added features☆73Updated last year
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆78Updated 5 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 6 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆105Updated 2 months ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆78Updated 2 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆163Updated 2 years ago
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- ☆110Updated last year
- Benchmarking RL generalization in an interpretable way.☆132Updated 9 months ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year