Experiments in applying interpretability techniques to learned reward functions.
☆10Dec 11, 2020Updated 5 years ago
Alternatives and similar repositories for interpreting-rewards
Users that are interested in interpreting-rewards are comparing it to the libraries listed below
Sorting:
- Companion code for ICML 2022 paper "Imitation Learning by Estimating Expertise of Demonstrators"☆11Jul 5, 2023Updated 2 years ago
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆78Dec 5, 2023Updated 2 years ago
- Benchmark environments for reward modelling and imitation learning algorithms.☆46Sep 19, 2023Updated 2 years ago
- A formalisation of Cartesian Frames, a perspective on embedded agency, in the HOL theorem prover.☆20Dec 20, 2021Updated 4 years ago
- Reward Learning by Simulating the Past☆46May 9, 2019Updated 6 years ago
- ☆22Sep 9, 2021Updated 4 years ago
- ☆22Jan 14, 2026Updated last month
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Sep 26, 2020Updated 5 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Jun 20, 2019Updated 6 years ago
- Library to compare and evaluate reward functions☆67Oct 23, 2023Updated 2 years ago
- ☆28Mar 13, 2019Updated 6 years ago
- NeurIPS[2023] "Multi-Modal Inverse Constrained Reinforcement Learning from a Mixture of Demonstrations" official implement☆10Feb 19, 2024Updated 2 years ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Aug 29, 2019Updated 6 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- Identifying Mislabeled Instances inClassification Datasets☆29Nov 21, 2022Updated 3 years ago
- Adversarial Inverse Reinforcement Learning Implement For Mountain Car☆36Sep 21, 2021Updated 4 years ago
- Reinforcement learning benchmarking.☆39Oct 22, 2018Updated 7 years ago
- 🦞 A curated list of Molt ecosystem services, platforms, and tools for AI agents — Moltbook, MoltCities, Molthunt, MoltMatch, and more.☆26Updated this week
- A JAX-accelerated implementation of the Procedural Content Generation via Reinforcement Learning (PCGRL) framework. We train RL agents to…☆12Nov 26, 2025Updated 3 months ago
- Active Inference & Category Theory☆10Mar 11, 2024Updated last year
- This repository provides a Python package so everyone can easily try Computer Use of ClaudeAI.☆10Nov 1, 2024Updated last year
- Tools and models for estimating Filecoin energy use from on-chain proofs☆11Jun 14, 2024Updated last year
- Project Gold ✨☆11Jan 29, 2026Updated last month
- A Cython library to solve the Bittensor registration POW on CUDA☆15Aug 15, 2025Updated 6 months ago
- Code used in the analyses described in "Personalized brain circuit scores identify clinically distinct biotypes in depression and anxiety…☆11May 4, 2024Updated last year
- ☆12Feb 16, 2024Updated 2 years ago
- Terrier's desktop search demo product☆13Aug 2, 2018Updated 7 years ago
- This project focuses on using deep learning to replace text in images while retaining the same font and style.☆10Dec 9, 2019Updated 6 years ago
- Adaptive Neuro-Symbolic Network Agent☆41Jun 11, 2022Updated 3 years ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47May 31, 2024Updated last year
- ☆11Mar 13, 2023Updated 2 years ago
- A small somewhat risk-like game. It is based upon a physics simulation describing an elastic graph.☆11Apr 13, 2021Updated 4 years ago
- A textbook on informal homotopy type theory -- Vladimir's fork, retained because there is a pull request based on it.☆11Feb 28, 2015Updated 10 years ago
- Coq集合模型论☆11Aug 18, 2022Updated 3 years ago
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- SEAGE (Search Agents) is a hyper-heuristic framework for metaheuristic collaboration.☆11Oct 14, 2023Updated 2 years ago
- A new language for optimization☆13May 17, 2021Updated 4 years ago
- Code for Max-Margin Deep Generative Models☆12Jan 1, 2015Updated 11 years ago