HumanCompatibleAI / interpreting-rewardsLinks
Experiments in applying interpretability techniques to learned reward functions.
☆10Updated 4 years ago
Alternatives and similar repositories for interpreting-rewards
Users that are interested in interpreting-rewards are comparing it to the libraries listed below
Sorting:
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Updated 6 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Updated last year
- Library to compare and evaluate reward functions☆67Updated last year
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- ☆31Updated 2 years ago
- AGAC: Adversarially Guided Actor-Critic☆48Updated 3 years ago
- Python implementation of tabular asynchronous actor critic☆11Updated 9 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 7 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Updated 4 years ago
- A web based platform for collecting human actions in reinforcement learning environments☆30Updated last year
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25Updated 6 years ago
- Generalised UDRL☆37Updated 3 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 5 years ago
- Reinforcement Learning via Latent State Decoding☆29Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆62Updated last year
- This repository contains implementations of the paper VUSFA☆14Updated 4 years ago
- Gym wrapper for pysc2☆10Updated 2 years ago
- ☆28Updated 2 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Updated last year
- ☆44Updated 6 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Updated 6 years ago
- ICRL 2020☆19Updated 5 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 6 years ago
- Collection of in-progress libraries for entity neural networks.☆30Updated 3 years ago