jvmncs / safe-grid-agents
Training (hopefully) safe agents in gridworlds
☆25Updated 5 years ago
Alternatives and similar repositories for safe-grid-agents:
Users that are interested in safe-grid-agents are comparing it to the libraries listed below
- Interpretability dashboard for reinforcement learners☆16Updated 5 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆93Updated 6 years ago
- The Machine Learning Toybox for testing the behavior of autonomous agents.☆27Updated 2 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆31Updated 5 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- ☆43Updated 5 years ago
- Reward Learning by Simulating the Past☆44Updated 5 years ago
- Train self-modifying neural networks with neuromodulated plasticity☆76Updated 5 years ago
- Training Sonic with RLlib☆57Updated last year
- ☆85Updated 4 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆60Updated 3 years ago
- ☆44Updated 6 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 6 years ago
- An environment for benchmarking commonsense agents☆28Updated 4 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Code release for Learning with Opponent-Learning Awareness and variations.☆146Updated last year
- ☆80Updated last year
- Collection of tutorials, exercises and papers on RL☆17Updated 7 years ago
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Some hard problems for reinforcement learning.☆32Updated 6 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆17Updated 6 years ago
- ☆20Updated 5 years ago
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆58Updated 4 years ago
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago
- Tensor Based Environment Framework for Training RL Agents - Pre Alpha☆8Updated 4 years ago
- Our NIPS 2017: Learning to Run source code☆55Updated last year
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago