alexander-turner / attainable-utility-preservationLinks

☆12

Alternatives and similar repositories for attainable-utility-preservation

Users that are interested in attainable-utility-preservation are comparing it to the libraries listed below

Sorting:

neale / avoiding-side-effects
Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
☆12Updated 4 years ago
minqi / wordcraft
An environment for benchmarking commonsense agents
☆29Updated 4 years ago
PartnershipOnAI / safelife
SafeLife: safety benchmarks for reinforcement learning agents
☆60Updated 4 years ago
crunchiness / lernd
Lernd is ∂ILP (dILP) framework implementation based on Deepmind's paper Learning Explanatory Rules from Noisy Data.
☆24Updated 2 years ago
paulfchristiano / amplification
☆9Updated 5 years ago
trishullab / houdini
HOUDINI: Lifelong Learning as Program Synthesis
☆48Updated 3 months ago
mfranzs / meta-learning-curiosity-algorithms
☆80Updated last year
facebookresearch / adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
☆63Updated last year
koulanurag / mmn
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
☆50Updated 2 years ago
HumanCompatibleAI / evaluating-rewards
Library to compare and evaluate reward functions
☆67Updated last year
sebastianrisi / ga-world-models
☆20Updated 6 years ago
google-deepmind / cartesian-frames
A formalisation of Cartesian Frames, a perspective on embedded agency, in the HOL theorem prover.
☆19Updated 3 years ago
RichardEvans / apperception
☆80Updated 4 years ago
samacqua / LARC
Language-annotated Abstraction and Reasoning Corpus
☆88Updated 2 years ago
maciej-sypetkowski / autoascend
The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge
☆59Updated 2 years ago
andrewschreiber / agent
Interpretability dashboard for reinforcement learners
☆16Updated 6 years ago
ucl-dark / paired
PAIRED in PyTorch 🔥
☆62Updated 2 years ago
eaplatanios / jelly-bean-world
A framework for experimenting with never-ending learning
☆79Updated 9 months ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
921kiyo / symbolic-rl
Symbolic Reinforcement Learning using Inductive Logic Programming
☆62Updated 2 years ago
uber-research / backpropamine
Train self-modifying neural networks with neuromodulated plasticity
☆77Updated 5 years ago
andyljones / boardlaw
Scaling scaling laws with board games.
☆49Updated 2 years ago
google-deepmind / dm_hard_eight
☆84Updated 4 years ago
mtensor / rulesynthesis
Code for "Learning Compositional Rules via Neural Program Synthesis"
☆60Updated 4 years ago
brain-research / mirage-rl
Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.
☆17Updated 6 years ago
louiskirsch / vsml-neurips2021
Code for "Meta Learning Backpropagation And Improving It" @ NeurIPS 2021 https://arxiv.org/abs/2012.14905
☆33Updated 3 years ago
mtrazzi / gym-alttp-gridworld
A gym environment for Stuart Armstrong's model of a treacherous turn.
☆18Updated 6 years ago
ericjang / maml-jax
Implementation of Model-Agnostic Meta-Learning (MAML) in Jax
☆191Updated 2 years ago
redwoodresearch / interp
Redwood Research's transformer interpretability tools
☆14Updated 3 years ago
HumanCompatibleAI / rlsp
Reward Learning by Simulating the Past
☆44Updated 6 years ago