Stein Variational Policy Gradient for REINFORCE
☆18Jul 12, 2017Updated 8 years ago
Alternatives and similar repositories for svpg_REINFORCE
Users that are interested in svpg_REINFORCE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 2, 2018Updated 8 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Dec 13, 2018Updated 7 years ago
- Experiments of amortized stein variational gradient☆16Apr 30, 2017Updated 8 years ago
- ☆13Jun 23, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of Stein Variational Gradient Descent with TensorFlow 2.0☆12Sep 11, 2019Updated 6 years ago
- Tensorflow implementation of Stein Variational Gradient Descent (SVGD)☆26Jan 13, 2018Updated 8 years ago
- Parser for files in OpenDRIVE format, offers additional functions to navigate through the road network☆12Sep 6, 2017Updated 8 years ago
- ICML 2018 Self-Imitation Learning☆275Apr 18, 2020Updated 5 years ago
- code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"☆421Mar 21, 2024Updated 2 years ago
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- projected Stein variational gradient descent☆12Oct 2, 2021Updated 4 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- Sinkhorn Barycenters via Frank-Wolfe algorithm☆26Feb 3, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Optimal Control Tutorials☆15Aug 20, 2020Updated 5 years ago
- Implementation of the POIS algorithm☆15Apr 9, 2019Updated 7 years ago
- PyTorch implementation of DARLA preprocessing models☆11Jan 30, 2018Updated 8 years ago
- Practical tools for quantifying how well a sample approximates a target distribution☆28Aug 5, 2020Updated 5 years ago
- Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…☆12Jun 20, 2018Updated 7 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated 11 months ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Jun 6, 2019Updated 6 years ago
- ☆12Apr 19, 2024Updated last year
- Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…☆12Feb 23, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆348Nov 22, 2018Updated 7 years ago
- CFG-GAN: Composite functional gradient learning of generative adversarial models☆15Jul 9, 2020Updated 5 years ago
- ☆10May 13, 2025Updated 11 months ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 5 years ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- PyTorch implementation of Stein Variational Gradient Descent☆48Jun 16, 2023Updated 2 years ago
- ☆11Jan 22, 2015Updated 11 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆93Apr 17, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Safe Reinforcement Learning with Natural Language Constraints☆15Oct 24, 2021Updated 4 years ago
- ☆28Oct 26, 2020Updated 5 years ago
- NTK reading group☆85Nov 14, 2019Updated 6 years ago
- The source code for "An Actor Critic Algorithm for Structured Prediction"☆166Jun 6, 2017Updated 8 years ago
- My custom LaTeX classes and styles.☆14Dec 26, 2016Updated 9 years ago
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- HANA XS Advanced Python Buildpack and example multi-target-application (This Repository has been archived upon Members choice)☆10Feb 19, 2020Updated 6 years ago