Shallow-Updates-for-Deep-RL / Shallow_Updates_for_Deep_RL
Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"
☆18Updated 7 years ago
Alternatives and similar repositories for Shallow_Updates_for_Deep_RL:
Users that are interested in Shallow_Updates_for_Deep_RL are comparing it to the libraries listed below
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆42Updated 9 years ago
- E2C implementation in PyTorch☆43Updated 7 years ago
- Distributed A3C☆34Updated 7 years ago
- The Variational Homoencoder: Learning to learn high capacity generative models from few examples☆34Updated last year
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Updated 8 years ago
- PyTorch implementation of AVF☆45Updated 4 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Code for the paper "Representation Learning for Grounded Spatial Reasoning"☆52Updated 4 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Updated 7 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Implementation of iterative inference in deep latent variable models☆43Updated 5 years ago
- RL Experiments from our paper "Backpropagation Through the Void": https://arxiv.org/abs/1711.00123. Lovingly forked from OpenAI's RL Base…☆38Updated 7 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Updated 5 years ago
- ☆56Updated 6 years ago
- A2C for GVG-AI☆21Updated 6 years ago
- Models built with TensorFlow☆25Updated 6 years ago
- Model-Free Episodic Control☆14Updated 8 years ago
- Implementation of Adversarial Variational Optimization in PyTorch☆43Updated 6 years ago
- Code release for the paper "Calibrating Energy-based Generative Adversarial Networks"☆24Updated 7 years ago
- Lagrangian VAE☆28Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Ranking Policy Gradient☆23Updated 5 years ago
- This is my implementation of the Optimality Tightening☆37Updated 7 years ago
- ☆17Updated 7 years ago
- ☆38Updated 8 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆10Updated 6 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆29Updated 2 years ago
- Inferring beliefs about dynamics from behavior☆29Updated 6 years ago