Library to compare and evaluate reward functions
☆67Oct 23, 2023Updated 2 years ago
Alternatives and similar repositories for evaluating-rewards
Users that are interested in evaluating-rewards are comparing it to the libraries listed below
Sorting:
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- ☆12Apr 25, 2022Updated 3 years ago
- Experiments in applying interpretability techniques to learned reward functions.☆10Dec 11, 2020Updated 5 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆28Dec 7, 2021Updated 4 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆110Jan 23, 2022Updated 4 years ago
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆78Dec 5, 2023Updated 2 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- ☆16Nov 27, 2016Updated 9 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 3 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- ☆31Feb 20, 2021Updated 5 years ago
- Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"☆17Apr 9, 2024Updated last year
- ☆18Mar 28, 2023Updated 2 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆34Mar 29, 2023Updated 2 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Jul 28, 2018Updated 7 years ago
- ☆58Jun 30, 2022Updated 3 years ago
- ☆20Mar 14, 2021Updated 4 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆56Apr 27, 2020Updated 5 years ago
- A mini library for Policy Gradients with Parameter-based Exploration, with reference implementation of the ClipUp optimizer (https://arxi…☆73Dec 10, 2020Updated 5 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆88Jan 22, 2019Updated 7 years ago
- ☆28Jan 11, 2021Updated 5 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Nov 24, 2022Updated 3 years ago
- ☆10Mar 10, 2021Updated 4 years ago
- Official implementation of "Accelerating Reinforcement Learning with Learned Skill Priors", Pertsch et al., CoRL 2020☆221Jun 5, 2023Updated 2 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- Guided-Meta Policy Search☆39Jan 19, 2023Updated 3 years ago
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Mar 15, 2021Updated 4 years ago
- ☆44Oct 27, 2018Updated 7 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Nov 15, 2018Updated 7 years ago
- ☆12Jul 22, 2021Updated 4 years ago
- ☆11Sep 11, 2020Updated 5 years ago
- Automatic Recall Machines: Internal Replay, Continual Learning and the Brain☆11Jul 14, 2020Updated 5 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆46Sep 20, 2023Updated 2 years ago