PartnershipOnAI / safelifeLinks
SafeLife: safety benchmarks for reinforcement learning agents
☆61Updated 4 years ago
Alternatives and similar repositories for safelife
Users that are interested in safelife are comparing it to the libraries listed below
Sorting:
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated 2 years ago
- Upside-Down Reinforcement Learning (⅂ꓤ) implementation in PyTorch. Based on the paper published by Jürgen Schmidhuber.☆77Updated 5 years ago
- ☆80Updated 2 years ago
- ☆84Updated 5 years ago
- Training (hopefully) safe agents in gridworlds☆25Updated 6 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆64Updated 2 years ago
- An interface with micropolis for city-building agents, packaged as an OpenAI gym environment☆158Updated 8 months ago
- Reinforcement learning algorithms☆41Updated 6 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆78Updated 6 years ago
- PyTorch code to train and evaluate Procgen tasks☆25Updated 5 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆84Updated 5 years ago
- This project was moved to: https://github.com/coax-dev/coax☆161Updated 3 years ago
- Reward Learning by Simulating the Past☆46Updated 6 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Updated 5 years ago
- 2019 talk at GECCO☆68Updated 6 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆33Updated 6 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 7 years ago
- Implementation of Model-Agnostic Meta-Learning (MAML) in Jax☆191Updated 3 years ago
- Generic reinforcement learning codebase in TensorFlow☆95Updated 4 years ago
- A set of 13 diverse machine-learning tasks that require memory to solve.☆224Updated 4 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it