david-abel / rl_info_theory
A collection of code investigating the use of information theory for abstractions in RL
☆16Updated 6 years ago
Alternatives and similar repositories for rl_info_theory:
Users that are interested in rl_info_theory are comparing it to the libraries listed below
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Updated 4 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Updated 6 years ago
- ☆35Updated 6 years ago
- Variational Reinforcement Learning☆16Updated 7 months ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Simple tools for statistical analyses in RL experiments☆66Updated 6 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- Models built with TensorFlow☆25Updated 6 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆18Updated 5 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- ☆44Updated 6 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆29Updated 2 years ago
- ICRL 2020☆19Updated 5 years ago
- Code for "Learning Inductive Biases with Simple Neural Networks" (Feinman & Lake, 2018).☆21Updated 6 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- Inferring beliefs about dynamics from behavior☆29Updated 6 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆25Updated 2 years ago
- Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)☆10Updated 4 years ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 5 years ago
- Distributed DDPG implementation in pytorch☆9Updated 6 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 7 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆21Updated 4 years ago
- Generalised UDRL☆37Updated 2 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆35Updated 6 years ago