david-abel / rl_info_theoryLinks
A collection of code investigating the use of information theory for abstractions in RL
☆16Updated 6 years ago
Alternatives and similar repositories for rl_info_theory
Users that are interested in rl_info_theory are comparing it to the libraries listed below
Sorting:
- ☆35Updated 6 years ago
- Simple tools for statistical analyses in RL experiments☆67Updated 7 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆96Updated 4 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Updated 6 years ago
- PyTorch implementation of Memory Augmented Self-Play☆52Updated 4 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆96Updated 6 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 6 years ago
- A simple Gridworld environment for Open AI gym☆25Updated 7 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- ☆80Updated last year
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Reward Learning by Simulating the Past☆44Updated 6 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆20Updated 8 years ago
- Models built with TensorFlow☆25Updated 6 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Updated 6 years ago
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Updated 5 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 4 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆19Updated 5 years ago
- A2C for GVG-AI☆22Updated 6 years ago
- Inferring beliefs about dynamics from behavior☆29Updated 7 years ago
- E2C implementation in PyTorch☆43Updated 8 years ago
- This is my implementation of the Optimality Tightening☆37Updated 8 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆24Updated 6 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆29Updated 3 years ago