jvmncs / safe-grid-agentsLinks
Training (hopefully) safe agents in gridworlds
☆25Updated 6 years ago
Alternatives and similar repositories for safe-grid-agents
Users that are interested in safe-grid-agents are comparing it to the libraries listed below
Sorting:
- Interpretability dashboard for reinforcement learners☆16Updated 6 years ago
- Some hard problems for reinforcement learning.☆32Updated 6 years ago
- Tensor Based Environment Framework for Training RL Agents - Pre Alpha☆8Updated 5 years ago
- A gym environment for Stuart Armstrong's model of a treacherous turn.☆18Updated 6 years ago
- ☆85Updated 4 years ago
- MXNet Implementation of DeepMind's Neural Arithmetic Logic Units (NALU)☆18Updated 6 years ago
- ☆43Updated 5 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆60Updated 4 years ago
- Neural Arithmetic Logic Units(arXiv:1808.00508)☆12Updated 6 years ago
- Training Sonic with RLlib☆59Updated 2 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆95Updated 6 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Mixture Density Networks (Bishop, 1994) tutorial in JAX☆59Updated 5 years ago
- ☆22Updated 6 years ago
- Tensorflow implementation of Neural Arithmetic Logic Unit, Trask et al.☆28Updated 6 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13Updated 4 years ago
- Deep RL Bootcamp solutions☆35Updated 7 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆21Updated 3 years ago
- Models built with TensorFlow☆25Updated 6 years ago
- Inferring beliefs about dynamics from behavior☆29Updated 7 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Collection of tutorials, exercises and papers on RL☆17Updated 7 years ago
- Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks☆50Updated 2 years ago
- ☆20Updated 5 years ago
- ☆24Updated 9 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Basic pytorch implementation of NAC/NALU from Neural Arithmetic Logic Units paper by trask et.al☆115Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- TensorFlow impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆65Updated 9 years ago