RL Experiments from our paper "Backpropagation Through the Void": https://arxiv.org/abs/1711.00123. Lovingly forked from OpenAI's RL Baseline repo.
☆39Feb 21, 2018Updated 8 years ago
Alternatives and similar repositories for BackpropThroughTheVoidRL
Users that are interested in BackpropThroughTheVoidRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimizing control variates for black-box gradient estimation☆163Jul 26, 2019Updated 6 years ago
- ☆11Sep 20, 2016Updated 9 years ago
- Optimally-weighted herding is Bayesian Quadrature☆17Jul 8, 2016Updated 9 years ago
- Estimating Gradients for Discrete Random Variables by Sampling without Replacement☆40Feb 14, 2020Updated 6 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- Deep GPs with GPy☆31May 18, 2016Updated 10 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Jan 5, 2021Updated 5 years ago
- ☆31Oct 26, 2020Updated 5 years ago
- Example automatic differentiation code in Scala☆31Jun 22, 2020Updated 5 years ago
- Code Released for NeurIPS 2018 paper: Synthesized Policies for Transfer and Adaptation across Tasks and Environments☆16Apr 17, 2019Updated 7 years ago
- ReGAN: Sequence GAN using RE[INFORCE|LAX|BAR] based PG estimators☆41May 12, 2018Updated 8 years ago
- ☆24Feb 3, 2019Updated 7 years ago
- ☆80Sep 4, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 9 years ago
- Tensorflow implementation of deformable conv and pooling operations.☆10Jul 17, 2017Updated 8 years ago
- This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement le…☆220Jul 22, 2019Updated 6 years ago
- A Python implementation of the gradient REBAR estimator.☆47Jun 13, 2018Updated 7 years ago
- Code for Stochastic Hyperparameter Optimization through Hypernetworks☆28Jun 11, 2018Updated 7 years ago
- Berg - Run GPU-backed experiments on gcloud☆26Nov 17, 2018Updated 7 years ago
- Ancestral Gumbel-Top-k Sampling☆25Apr 11, 2020Updated 6 years ago
- Variational Information Maximization for Feature Selection☆11Aug 24, 2016Updated 9 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The PyBullet wrapper (Vat) for Neural Task Programming☆34Apr 24, 2018Updated 8 years ago
- Physics Engine in Python☆15Feb 25, 2016Updated 10 years ago
- Use deep neural networks to synthesize the Neuroscore for evaluating Generative Adversarial Networks☆10Jun 1, 2020Updated 5 years ago
- ☆78Sep 18, 2017Updated 8 years ago
- Reinforcement learning with Rust☆14Jul 31, 2022Updated 3 years ago
- Implementation of Adversarial Variational Optimization in PyTorch☆42Aug 7, 2018Updated 7 years ago
- A TensorFlow reimplementation of GalSim☆10Mar 8, 2022Updated 4 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- simple example of gradient-based hyperparameter optimization using tensorflow☆19Feb 29, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python term rewriting☆30Feb 14, 2013Updated 13 years ago
- A collection of python routines to help identify and morph objects.☆13Oct 2, 2021Updated 4 years ago
- DNI (Decoupled Neural Interfaces using Synthetic Gradients) Implementation with Tensorflow.☆28Jan 26, 2018Updated 8 years ago
- ☆85May 29, 2019Updated 6 years ago
- Build-to-Order BLAS☆12Apr 9, 2019Updated 7 years ago
- Gaussian processes in TensorFlow with modifications to allow inter-domain inducing variables☆13Aug 11, 2017Updated 8 years ago
- Interface for deep reinforcement learning☆24Aug 6, 2021Updated 4 years ago