wgrathwohl / BackpropThroughTheVoidRL
RL Experiments from our paper "Backpropagation Through the Void": https://arxiv.org/abs/1711.00123. Lovingly forked from OpenAI's RL Baseline repo.
☆38Updated 7 years ago
Alternatives and similar repositories for BackpropThroughTheVoidRL:
Users that are interested in BackpropThroughTheVoidRL are comparing it to the libraries listed below
- A Python implementation of the gradient REBAR estimator.☆46Updated 6 years ago
- Implementation of Coulomb GANs☆62Updated 3 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Updated 7 years ago
- Torch7 impementation of: Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images☆42Updated 9 years ago
- Lagrangian VAE☆28Updated 6 years ago
- ☆37Updated 5 years ago
- Variational Message Passing for Structured VAE (Code for ICLR 2018 paper)☆44Updated 7 years ago
- ☆17Updated 7 years ago
- Code release for the paper "Calibrating Energy-based Generative Adversarial Networks"☆24Updated 7 years ago
- Predictive State Recurrent Neural Networks☆18Updated 4 years ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Updated 8 years ago
- A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions☆22Updated 5 years ago
- Optimizing control variates for black-box gradient estimation☆162Updated 5 years ago
- PyTorch implementation of AVF☆45Updated 4 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 7 years ago
- Example implementation of the Bayesian neural network in "Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteri…☆30Updated 4 years ago
- Code accompanying the OptionGAN paper.☆44Updated 6 years ago
- Reimplementation code for the paper "Generative Temporal Models with Spatial Memory for Partially Observed Environments"☆29Updated 2 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Updated 7 years ago
- Implementation of "Variational Inference for Monte Carlo Objectives"☆21Updated 4 years ago
- A Tensorfflow implementation of Attend, Infer, Repeat☆81Updated 6 years ago
- Implementation of REBAR in PyTorch☆17Updated 6 years ago
- SCAN: Learning Abstract Hierarchical Compositional Visual Concepts☆54Updated 7 years ago
- Code for "Inference Suboptimality in Variational Autoencoders"☆14Updated 5 years ago
- Code for "Efficient optimization of loops and limits with randomized telescoping sums"☆27Updated 5 years ago
- TensorFlow implementation of "noisy K-FAC" and "noisy EK-FAC".☆60Updated 6 years ago
- A generic Monte Carlo method based on the Gumbel-Max trick.☆32Updated 8 years ago
- Variational Walkback, NIPS'17☆28Updated 7 years ago
- Recurrent Back Propagation, Back Propagation Through Optimization, ICML 2018☆41Updated 6 years ago
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Updated 6 years ago