HumanCompatibleAI / evaluating-rewardsLinks
Library to compare and evaluate reward functions
β67Updated last year
Alternatives and similar repositories for evaluating-rewards
Users that are interested in evaluating-rewards are comparing it to the libraries listed below
Sorting:
- PAIRED in PyTorch π₯β60Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".β61Updated last year
- impact-driven-explorationβ131Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the β¦β86Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"β100Updated 3 years ago
- A collection of RL algorithms written in JAX.β98Updated 2 years ago
- AGAC: Adversarially Guided Actor-Criticβ49Updated 3 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Explorationβ68Updated 3 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objectiveβ80Updated 2 years ago
- Modifiable OpenAI Gym environments for studying generalization in RLβ87Updated 6 years ago
- On the model-based stochastic value gradient for continuous reinforcement learningβ55Updated last year
- Invariant Causal Prediction for Block MDPsβ44Updated 4 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according β¦β35Updated last year
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020β25Updated 4 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.β74Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discoveryβ81Updated 2 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Reβ¦β106Updated 3 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)β70Updated last year
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"β84Updated 5 years ago
- Revisiting Rainbowβ75Updated 3 years ago
- β86Updated 10 months ago
- β45Updated 2 years ago
- β112Updated 2 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"β46Updated last year
- JAX implementations of core Deep RL algorithmsβ79Updated 3 years ago
- Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined witβ¦β190Updated 3 years ago
- MultiTask Environments for Reinforcement Learning.β75Updated 2 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"β44Updated last year
- Baselines for gymnax π€β66Updated 2 years ago
- β86Updated 3 years ago