jvmncs / safe-grid-agentsLinks

Training (hopefully) safe agents in gridworlds

☆25

Alternatives and similar repositories for safe-grid-agents

Users that are interested in safe-grid-agents are comparing it to the libraries listed below

Sorting:

PartnershipOnAI / safelife
SafeLife: safety benchmarks for reinforcement learning agents
☆60Updated 4 years ago
HumanCompatibleAI / rlsp
Reward Learning by Simulating the Past
☆44Updated 6 years ago
mtrazzi / gym-alttp-gridworld
A gym environment for Stuart Armstrong's model of a treacherous turn.
☆18Updated 6 years ago
SuReLI / dyna-gym
This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆32Updated 6 years ago
andrewschreiber / agent
Interpretability dashboard for reinforcement learners
☆16Updated 6 years ago
OpenMined / CampX
Tensor Based Environment Framework for Training RL Agents - Pre Alpha
☆8Updated 5 years ago
hardmaru / gecco-tutorial-2019
2019 talk at GECCO
☆68Updated 6 years ago
alexis-jacq / LOLA_DiCE
Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)
☆95Updated 6 years ago
flowersteam / Unsupervised_Goal_Space_Learning
Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"
☆21Updated 7 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆51Updated 6 years ago
koulanurag / mmn
Moore Machine Networks (MMN): Learning Finite-State Representations of Recurrent Policy Networks
☆50Updated 2 years ago
openai / sonic-on-ray
Training Sonic with RLlib
☆59Updated 2 years ago
llealgt / NALUs
Neural Arithmetic Logic Units(arXiv:1808.00508)
☆11Updated 6 years ago
jeappen / gym-grid
A simple Gridworld environment for Open AI gym
☆25Updated 7 years ago
JohnLangford / RL_acid
Some hard problems for reinforcement learning.
☆31Updated 6 years ago
ehknight / natural-gradient-deep-q-learning
☆22Updated 6 years ago
google-deepmind / symplectic-gradient-adjustment
A colab that implements the Symplectic Gradient Adjustment optimizer from "The mechanics of n-player differentiable games"
☆153Updated 6 years ago
alok / rl_implementations
Reinforcement learning algorithm implementations and ML experimentation workspace
☆43Updated 6 years ago
ericjang / maml-jax
Implementation of Model-Agnostic Meta-Learning (MAML) in Jax
☆191Updated 2 years ago
mfranzs / meta-learning-curiosity-algorithms
☆80Updated last year
pmlg / deep-rl-bootcamp
Deep RL Bootcamp solutions
☆35Updated 7 years ago
zuoxingdong / ML-LaTeX-Shortcuts
A collection of LaTeX shortcuts for commonly used mathematical expressions in machine learning.
☆58Updated 6 years ago
paintception / Deep-Quality-Value-DQV-Learning-
DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm
☆25Updated 2 years ago
RL-Group / tutorials-and-papers
Collection of tutorials, exercises and papers on RL
☆17Updated 7 years ago
hardmaru / mdn_jax_tutorial
Mixture Density Networks (Bishop, 1994) tutorial in JAX
☆60Updated 5 years ago
KMarino / hrl-ep3
Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies
☆15Updated 6 years ago
Feryal / craft-env
☆44Updated 6 years ago
jachiam / surprise
Surprise-based intrinsic motivation for deep reinforcement learning
☆20Updated 8 years ago
instadeepai / AlphaNPI
Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.
☆79Updated last year
attentionagent / attentionagent.github.io
Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)
☆21Updated 3 years ago