alexander-turner / attainable-utility-preservation
☆12Updated 3 years ago
Alternatives and similar repositories for attainable-utility-preservation:
Users that are interested in attainable-utility-preservation are comparing it to the libraries listed below
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Updated 3 years ago
- An environment for benchmarking commonsense agents☆28Updated 4 years ago
- ☆9Updated 5 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆60Updated 3 years ago
- PAIRED in PyTorch 🔥☆59Updated 2 years ago
- ☆80Updated last year
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Experiments in applying interpretability techniques to learned reward functions.☆10Updated 4 years ago
- ☆20Updated 5 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- HOUDINI: Lifelong Learning as Program Synthesis☆48Updated last month
- Lernd is ∂ILP (dILP) framework implementation based on Deepmind's paper Learning Explanatory Rules from Noisy Data.☆25Updated 2 years ago
- ☆85Updated 4 years ago
- ☆24Updated 6 years ago
- Library to compare and evaluate reward functions☆66Updated last year
- Clockwork VAEs in JAX/Flax☆32Updated 3 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆24Updated 4 years ago
- A job launching library for docker, EC2, GCP, etc.☆57Updated 3 years ago
- ☆79Updated 4 years ago
- Interpretability dashboard for reinforcement learners☆16Updated 5 years ago
- Map-Elites based on Evolution Strategies☆31Updated 3 years ago
- Implementation of https://medium.com/ai-control/alba-an-explicit-proposal-for-aligned-ai-17a55f60bbcf☆27Updated 7 years ago
- Code for "Learning Compositional Rules via Neural Program Synthesis"☆60Updated 4 years ago
- Webppl library for generating Gridworld MDPs. JS library for displaying Gridworld.☆22Updated 8 years ago
- On the pitfalls of measuring emergent communication☆34Updated 6 years ago
- ☆43Updated 7 years ago
- Symbolic Reinforcement Learning using Inductive Logic Programming☆62Updated 2 years ago
- Plannable Approximations to MDP Homomorphisms: Equivariance under Actions☆30Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago