neale / avoiding-side-effectsLinks

Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments

☆12

Alternatives and similar repositories for avoiding-side-effects

Users that are interested in avoiding-side-effects are comparing it to the libraries listed below

Sorting:

minqi / wordcraft
An environment for benchmarking commonsense agents
☆29Updated 4 years ago
alexander-turner / attainable-utility-preservation
☆12Updated 4 years ago
ucl-dark / paired
PAIRED in PyTorch 🔥
☆62Updated 2 years ago
real-itu / Evocraft-py
A Python interface for Minecraft built on gRPC
☆125Updated 3 years ago
facebookresearch / adversarially-motivated-intrinsic-goals
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
☆63Updated last year
eaplatanios / jelly-bean-world
A framework for experimenting with never-ending learning
☆79Updated 9 months ago
mdcrosby / animal-ai
Animal-AI 3
☆65Updated 2 years ago
HumanCompatibleAI / evaluating-rewards
Library to compare and evaluate reward functions
☆67Updated last year
eilab-gt / NovGrid
Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …
☆35Updated last year
pathak22 / modular-assemblies
[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"
☆116Updated 5 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆88Updated 4 years ago
bstadie / krazyworld
krazy grid world
☆25Updated 5 years ago
facebookresearch / icp-block-mdp
Invariant Causal Prediction for Block MDPs
☆44Updated 5 years ago
uber-research / backpropamine
Train self-modifying neural networks with neuromodulated plasticity
☆77Updated 5 years ago
sebastianrisi / ga-world-models
☆20Updated 6 years ago
google-deepmind / dm_fast_mapping
☆54Updated 3 years ago
machelreid / can-wikipedia-help-offline-rl
Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu
☆105Updated 3 years ago
Miffyli / nle-sample-factory-baseline
☆22Updated 4 months ago
google-deepmind / dm_hard_eight
☆84Updated 4 years ago
ngoodger / nle-language-wrapper
Nethack Learning Environment Wrapper for Language Interface
☆38Updated last year
bmazoure / ppo_jax
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆57Updated 3 years ago
danijar / ninjax
General Modules for JAX
☆66Updated 4 months ago
mfranzs / meta-learning-curiosity-algorithms
☆80Updated last year
henry-prior / jax-rl
JAX implementations of core Deep RL algorithms
☆81Updated 3 years ago
kkhetarpal / ioc
Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020
☆25Updated 5 years ago
rlai-lab / Regularized-GradientTD
Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆36Updated 4 years ago
facebookresearch / impact-driven-exploration
impact-driven-exploration
☆131Updated last year
ElisevanderPol / symmetrizer
☆31Updated 4 years ago
AmiiThinks / AlphaEx
A Python Toolkit for Managing a Large Number of Experiments
☆32Updated last year
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆75Updated 4 years ago