HumanCompatibleAI / evaluating-rewards
Library to compare and evaluate reward functions
β65Updated last year
Alternatives and similar repositories for evaluating-rewards:
Users that are interested in evaluating-rewards are comparing it to the libraries listed below
- PAIRED in PyTorch π₯β58Updated 2 years ago
- impact-driven-explorationβ130Updated last year
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".β61Updated last year
- Modifiable OpenAI Gym environments for studying generalization in RLβ87Updated 6 years ago
- Invariant Causal Prediction for Block MDPsβ44Updated 4 years ago
- Revisiting Rainbowβ74Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the β¦β84Updated 3 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Explorationβ68Updated 3 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.β83Updated 5 years ago
- Reinforcement Learning with Latent Flowβ43Updated 3 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)β69Updated last year
- On the model-based stochastic value gradient for continuous reinforcement learningβ55Updated last year
- Efficient Exploration via State Marginal Matching (2019)β67Updated 5 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.β69Updated last year
- JAX implementations of core Deep RL algorithmsβ79Updated 2 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"β83Updated 5 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"β100Updated 2 years ago
- A collection of RL algorithms written in JAX.β95Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weightsβ¦β53Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representationsβ82Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objectiveβ79Updated 2 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Reβ¦β104Updated 3 years ago
- β43Updated last year
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020β25Updated 4 years ago
- Baselines for gymnax π€β66Updated last year
- β85Updated 4 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according β¦β35Updated 9 months ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculationsβ49Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learningβ110Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discoveryβ80Updated 2 years ago