HumanCompatibleAI/interpreting-rewards

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HumanCompatibleAI/interpreting-rewards)

HumanCompatibleAI / interpreting-rewards

Experiments in applying interpretability techniques to learned reward functions.

☆10

Alternatives and similar repositories for interpreting-rewards

Users that are interested in interpreting-rewards are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Stanford-ILIAD / ILEED
View on GitHub
Companion code for ICML 2022 paper "Imitation Learning by Estimating Expertise of Demonstrators"
☆11Jul 5, 2023Updated 3 years ago
smearle / autoverse
View on GitHub
Generative cellular automaton-like learning environments for RL.
☆20Jan 30, 2025Updated last year
qxcv / magical
View on GitHub
The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)
☆78Dec 5, 2023Updated 2 years ago
HumanCompatibleAI / atari-irl
View on GitHub
☆28Mar 13, 2019Updated 7 years ago
HumanCompatibleAI / seals
View on GitHub
Benchmark environments for reward modelling and imitation learning algorithms.
☆47Sep 19, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HuangJiaLian / AIRL_MountainCar
View on GitHub
Adversarial Inverse Reinforcement Learning Implement For Mountain Car
☆36Sep 21, 2021Updated 4 years ago
neale / avoiding-side-effects
View on GitHub
Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
☆12Jun 3, 2021Updated 5 years ago
HumanCompatibleAI / population-irl
View on GitHub
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆27Jun 20, 2019Updated 7 years ago
HumanCompatibleAI / learning_biases
View on GitHub
Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.
☆25Sep 26, 2020Updated 5 years ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
mixxorz / GarageBard
View on GitHub
An app for macOS that lets you play MIDI files as a bard on Final Fantasy XIV.
☆15Oct 27, 2022Updated 3 years ago
qiaoguanren / Multi-Modal-Inverse-Constrained-Reinforcement-Learning
View on GitHub
NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement
☆13Feb 19, 2024Updated 2 years ago
HumanCompatibleAI / evaluating-rewards
View on GitHub
Library to compare and evaluate reward functions
☆69Oct 23, 2023Updated 2 years ago
tsujuifu / pytorch_bco
View on GitHub
A PyTorch implementation of BCO
☆12Jun 19, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kywch / brax-trainer
View on GitHub
Brax + Pufferlib + CARBS for gpu-accelerated robotics RL
☆12Jun 12, 2025Updated last year
vwxyzjn / gym-pysc2
View on GitHub
Gym wrapper for pysc2
☆10Sep 16, 2022Updated 3 years ago
Farama-Foundation / Procgen-Staging
View on GitHub
Procgen2: A community maintained fork of procgen
☆12Aug 25, 2022Updated 3 years ago
gioramponi / sigma-girl-MIIRL
View on GitHub
Code of Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions
☆13May 22, 2023Updated 3 years ago
HumanCompatibleAI / rlsp
View on GitHub
Reward Learning by Simulating the Past
☆46May 9, 2019Updated 7 years ago
kach / acting-as-inverse-inverse-planning
View on GitHub
Code for our SIGGRAPH 2023 paper, "Acting as Inverse Inverse Planning"
☆20Apr 21, 2023Updated 3 years ago
MarkFzp / infogail-pomdp
View on GitHub
Multi-Modal Imitation Learning in Partially Observable Environments
☆14Sep 5, 2020Updated 5 years ago
malayandi / DemPrefCode
View on GitHub
Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"
☆12May 20, 2019Updated 7 years ago
koulanurag / dream-and-search
View on GitHub
Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"
☆12Jul 12, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
watcl-lab / cs886-winter-2025
View on GitHub
CS886: Graph Neural Networks
☆14Mar 28, 2025Updated last year
xlnwel / grl
View on GitHub
General-Purpose Reinforcement Learning
☆18Oct 31, 2021Updated 4 years ago
emd4600 / SporeModder
View on GitHub
A modding tool for the videogame Spore.
☆12Jul 24, 2020Updated 5 years ago
mast-group / codemining-core
View on GitHub
A set of tools for extracting tokens and ASTs from code
☆22Jun 5, 2018Updated 8 years ago
hcmlab / GANterfactual-RL
View on GitHub
Counterfactual explanations for Reinforcement Learning agents on Atari
☆12Apr 3, 2023Updated 3 years ago
EoinKenny / Prototype-Wrapper-Network-ICLR23
View on GitHub
☆12Dec 15, 2024Updated last year
heddendorp / tumi-app
View on GitHub
This project is now at heddendorp/tumi
☆12Apr 14, 2022Updated 4 years ago
OpenBYOND / OpenBYOND
View on GitHub
OpenBYOND for C#
☆11Oct 13, 2014Updated 11 years ago
subarnop / CapsNet-Fashion-MNIST
View on GitHub
Capsule Network for classification of Fashion-MNIST dataset.
☆10Nov 6, 2017Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mueller91 / labelfix
View on GitHub
Identifying Mislabeled Instances inClassification Datasets
☆29Nov 21, 2022Updated 3 years ago
yz93 / Learn-to-Interpret-Atari-Agents
View on GitHub
☆11Feb 20, 2020Updated 6 years ago
robometer / robometer-policy-learning
View on GitHub
Policy learning framework that uses Robometer reward model
☆15Jun 16, 2026Updated last month
braathwaate / strategoevaluator
View on GitHub
Manager program for stratego game bot interplay.
☆15Apr 16, 2015Updated 11 years ago
luchris429 / discovered-policy-optimisation
View on GitHub
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆12Jun 15, 2023Updated 3 years ago
vivekmyers / horizon_generalization
View on GitHub
☆15Feb 5, 2025Updated last year
chrisjsewell / pytest-notebook
View on GitHub
A pytest plugin for regression testing and regenerating Jupyter Notebooks
☆51Updated this week