neale / avoiding-side-effects
Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for avoiding-side-effects
- ☆12Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 5 months ago
- ☆36Updated last year
- Library to compare and evaluate reward functions☆61Updated last year
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆31Updated 4 years ago
- krazy grid world☆25Updated 4 years ago
- Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020☆25Updated 4 years ago
- ☆21Updated 2 years ago
- Nethack Learning Environment Wrapper for Language Interface☆33Updated last year
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆31Updated 4 years ago
- PAIRED in PyTorch 🔥☆56Updated last year
- An environment for benchmarking commonsense agents☆28Updated 4 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 2 years ago
- General Modules for JAX☆58Updated 3 months ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- An Open-Ended Agentic Simulator☆22Updated 2 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆132Updated last year
- ☆85Updated 3 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- ☆28Updated 3 years ago
- Scalable Opponent Shaping Experiments in JAX☆21Updated 6 months ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆78Updated 5 years ago
- impact-driven-exploration☆126Updated last year
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- Reinforcement Learning with Latent Flow☆43Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆17Updated last year