ivegner / PyDSRL
Faithful Python implementation of the paper "Towards Deep Symbolic Reinforcement Learning" by Garnelo et al.
☆13Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for PyDSRL
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 3 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆24Updated 5 years ago
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago
- ☆44Updated 5 years ago
- ☆42Updated 7 years ago
- ☆80Updated last year
- Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)☆74Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- ☆35Updated 6 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- Implementation of the Box-World environment from the paper "Relational Deep Reinforcement Learning"☆44Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆31Updated 4 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆35Updated 5 years ago
- Code implementation of: "Graying the black box: Understanding DQNs"☆19Updated 7 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆48Updated 2 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆53Updated 4 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- Inferring beliefs about dynamics from behavior☆28Updated 6 years ago
- Code for the CoRL 2019 paper AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers☆24Updated last year
- using information theory to encourage agents to cooperate and compete☆19Updated 6 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆101Updated last year
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- krazy grid world☆25Updated 4 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆78Updated last year
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago