gavlegoat / safe-learning
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for safe-learning
- ☆39Updated last year
- ☆62Updated 9 months ago
- Logically-Constrained Reinforcement Learning☆53Updated 4 months ago
- A gym interface for AI safety gridworlds created in pycolab.☆17Updated 2 years ago
- Object Centric Atari games☆48Updated this week
- Code for the paper: "Causal Influence Detection for Improving Efficiency in Reinforcement Learning", by Seitzer, M., Schölkopf, B., Marti…☆36Updated 2 years ago
- Code and data for the paper "Bridging RL Theory and Practice with the Effective Horizon"☆42Updated 4 months ago
- ☆34Updated last year
- Codes for the study "Variational Recurrent Models for Solving Partially Observable Control Tasks", published as a conference paper at ICL…☆50Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆133Updated last year
- Benchmarking RL generalization in an interpretable way.☆132Updated 9 months ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 3 years ago
- ☆36Updated last year
- ☆33Updated 2 months ago
- LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization☆32Updated last year
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆44Updated last year
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆15Updated 4 years ago
- ☆28Updated last year
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆23Updated 2 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated last year
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated last year
- ☆26Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆25Updated last year
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- ☆54Updated 8 months ago
- ☆15Updated 3 months ago
- Gym-like extensions for POMDP☆56Updated 3 years ago
- This is a minimal example to demonstrate how multi-agent reinforcement learning with differentiable communication channels and centralize…☆40Updated last year
- ☆28Updated last year
- A collection of RL algorithms written in JAX.☆95Updated 2 years ago