Benchmark environments for reward modelling and imitation learning algorithms.
☆46Sep 19, 2023Updated 2 years ago
Alternatives and similar repositories for seals
Users that are interested in seals are comparing it to the libraries listed below
Sorting:
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆78Dec 5, 2023Updated 2 years ago
- Experiments in applying interpretability techniques to learned reward functions.☆10Dec 11, 2020Updated 5 years ago
- Library to compare and evaluate reward functions☆67Oct 23, 2023Updated 2 years ago
- Clean PyTorch implementations of imitation and reward learning algorithms☆1,692Jan 7, 2025Updated last year
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 2 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- JVRC1 model files for MuJoCo☆10Apr 8, 2025Updated 10 months ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Dec 13, 2019Updated 6 years ago
- Pref-RL provides ready-to-use PbRL agents that are easily extensible.☆11Aug 31, 2022Updated 3 years ago
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆14Jan 6, 2025Updated last year
- ☆11Jun 8, 2020Updated 5 years ago
- SynPick dataset generator☆13Jul 8, 2021Updated 4 years ago
- Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"☆12May 20, 2019Updated 6 years ago
- Code accompanying the paper "Information Directed Reward Learning for Reinforcement Learning" (NeurIPS 2021).☆13Nov 16, 2021Updated 4 years ago
- A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer (https://arxi…☆73Dec 10, 2020Updated 5 years ago
- Evaluating different engineering tricks that make RL work☆15Jun 3, 2021Updated 4 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆35Oct 22, 2020Updated 5 years ago
- Algorithms for Uni-Modal Inverse Reinforcement Learning☆22Sep 23, 2022Updated 3 years ago
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- ☆18Mar 28, 2023Updated 2 years ago
- ☆19Nov 7, 2020Updated 5 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Mar 5, 2021Updated 4 years ago
- Learning to Coordinate Manipulation Skills via Skill Behavior Diversification (ICLR 2020)☆50Jun 22, 2022Updated 3 years ago
- Code for paper Causal Confusion in Imitation Learning☆46Dec 17, 2019Updated 6 years ago
- Creating fixed-length vectors to describe RL/GA policies☆20Oct 23, 2021Updated 4 years ago
- Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"☆333Nov 29, 2021Updated 4 years ago
- Code for demonstration example-task in RUDDER blog☆24May 19, 2020Updated 5 years ago
- ☆20Mar 14, 2021Updated 4 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- Summary of key papers in deep reinforcement learning. Heavily based on OpenAI SpinningUp.☆84Jun 24, 2020Updated 5 years ago
- Source code for ZePHyR: Zero-shot Pose Hypothesis Rating @ ICRA 2021☆25Aug 17, 2022Updated 3 years ago
- ☆84Nov 19, 2020Updated 5 years ago
- Eagerly Experimentable!!!☆26Jan 16, 2021Updated 5 years ago
- Experiments with Message Passing GNNs in C++ and PyTorch.☆26Jul 25, 2024Updated last year
- An Empirical Investigation of Representation Learning for Imitation (EIRLI), NeurIPS'21☆36Mar 4, 2023Updated 2 years ago
- Implementation of HER algorithm in the bit-flipping environment.☆17Feb 20, 2018Updated 8 years ago
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆28Jan 20, 2024Updated 2 years ago