HumanCompatibleAI / interpreting-rewards
Experiments in applying interpretability techniques to learned reward functions.
☆9Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for interpreting-rewards
- Benchmark environments for reward modelling and imitation learning algorithms.☆44Updated last year
- Library to compare and evaluate reward functions☆61Updated last year
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 5 years ago
- Map-Elites based on Evolution Strategies☆31Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- ☆28Updated 5 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 4 years ago
- Hierarchical Self-Play☆21Updated 5 years ago
- ☆71Updated 5 months ago
- Reinforcement learning algorithms in RLlib☆56Updated 6 months ago
- ☆32Updated 6 years ago
- TeachMyAgent is a testbed platform for Automatic Curriculum Learning methods in Deep RL.☆67Updated last year
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14Updated 6 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆24Updated 5 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 6 months ago
- ☆44Updated 5 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 4 years ago
- ☆91Updated 3 years ago
- (This repository is no longer being maintained.) Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for efficientl…☆27Updated 5 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆43Updated last year
- IV-RL - Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation☆36Updated 3 weeks ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆67Updated 3 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆87Updated 3 months ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆25Updated 6 years ago
- Using RLLib and PycoLab to explore intelligent cooperative behavior in sequential social dilemmas☆49Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆27Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆68Updated last year
- A curated list of awesome Inverse Reinforcement Learning resources.☆38Updated 2 years ago