PartnershipOnAI / safelife
SafeLife: safety benchmarks for reinforcement learning agents
☆59Updated 3 years ago
Related projects: ⓘ
- ☆80Updated 11 months ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆77Updated 11 months ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆76Updated 4 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 4 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆30Updated 5 years ago
- An environment for benchmarking commonsense agents☆28Updated 4 years ago
- ☆12Updated 3 years ago
- Reinforcement learning algorithms☆39Updated 5 years ago
- PyTorch code to train and evaluate Procgen tasks☆23Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- ☆85Updated 3 years ago
- Training (hopefully) safe agents in gridworlds☆25Updated 5 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆85Updated 5 years ago
- Library to compare and evaluate reward functions☆61Updated 10 months ago
- ☆44Updated 5 years ago
- Starter Kit for NeurIPS 2020 - Procgen Competition on AIcrowd☆88Updated last year
- Augmented environments with RL☆102Updated 5 years ago
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆37Updated 2 years ago
- PAIRED in PyTorch 🔥☆56Updated last year
- Train self-modifying neural networks with neuromodulated plasticity☆77Updated 4 years ago
- ☆27Updated 2 years ago
- On the pitfalls of measuring emergent communication☆33Updated 5 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆113Updated 4 years ago
- fork of rl-baseline-zoo☆21Updated 4 years ago
- Cellular automaton-based calculus for the masses☆23Updated 6 years ago
- Faithful Python implementation of the paper "Towards Deep Symbolic Reinforcement Learning" by Garnelo et al.☆13Updated 3 years ago
- ☆20Updated 5 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆83Updated 4 years ago