RL Experiments from our paper "Backpropagation Through the Void": https://arxiv.org/abs/1711.00123. Lovingly forked from OpenAI's RL Baseline repo.
☆39Feb 21, 2018Updated 8 years ago
Alternatives and similar repositories for BackpropThroughTheVoidRL
Users that are interested in BackpropThroughTheVoidRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimizing control variates for black-box gradient estimation☆163Jul 26, 2019Updated 6 years ago
- A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions☆22May 5, 2019Updated 6 years ago
- ☆11Sep 20, 2016Updated 9 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Feb 14, 2020Updated 6 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- Deep GPs with GPy☆31May 18, 2016Updated 9 years ago
- ☆30Oct 26, 2020Updated 5 years ago
- ☆18Mar 10, 2017Updated 9 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆17Jan 5, 2021Updated 5 years ago
- blender based random procedural object generation for bullet grasping☆39Sep 27, 2019Updated 6 years ago
- Active Imitation Learing with Noisy Guidance☆10May 29, 2020Updated 5 years ago
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Apr 17, 2019Updated 6 years ago
- ReGAN: Sequence GAN using RE[INFORCE|LAX|BAR] based PG estimators☆41May 12, 2018Updated 7 years ago
- ☆24Feb 3, 2019Updated 7 years ago
- Code for "Learning Inductive Biases with Simple Neural Networks" (Feinman & Lake, 2018).☆22Jan 8, 2019Updated 7 years ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 8 years ago
- Tensorflow implementation of deformable conv and pooling operations.☆10Jul 17, 2017Updated 8 years ago
- This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…☆219Jul 22, 2019Updated 6 years ago
- A Python implementation of the gradient REBAR estimator.☆47Jun 13, 2018Updated 7 years ago
- Code for Stochastic Hyperparameter Optimization through Hypernetworks☆28Jun 11, 2018Updated 7 years ago
- Berg - Run GPU-backed experiments on gcloud☆26Nov 17, 2018Updated 7 years ago
- ☆10Mar 28, 2022Updated 3 years ago
- Ancestral Gumbel-Top-k Sampling☆25Apr 11, 2020Updated 5 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- A Gym environment for RLE (Retro Learning Environment)☆15Apr 26, 2018Updated 7 years ago
- Physics Engine in Python☆15Feb 25, 2016Updated 10 years ago
- ☆13Apr 28, 2019Updated 6 years ago
- ☆78Sep 18, 2017Updated 8 years ago
- ☆31May 8, 2017Updated 8 years ago
- Reinforcement learning with Rust☆14Jul 31, 2022Updated 3 years ago
- Implementation of Adversarial Variational Optimization in PyTorch☆42Aug 7, 2018Updated 7 years ago
- FPsolve: solver for polynomial equations over omega-continuous semirings☆11Aug 15, 2015Updated 10 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- simple example of gradient-based hyperparameter optimization using tensorflow☆19Feb 29, 2016Updated 10 years ago
- Python term rewriting☆30Feb 14, 2013Updated 13 years ago
- A collection of python routines to help identify and morph objects.☆13Oct 2, 2021Updated 4 years ago
- Accompanying code for Aljadeff et al., 'Analysis of Neuronal Spike Trains, Deconstructed', Neuron (2016)☆11Jul 25, 2016Updated 9 years ago