Stein Variational Policy Gradient for REINFORCE
☆18Jul 12, 2017Updated 8 years ago
Alternatives and similar repositories for svpg_REINFORCE
Users that are interested in svpg_REINFORCE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 2, 2018Updated 8 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Dec 13, 2018Updated 7 years ago
- Experiments of amortized stein variational gradient☆16Apr 30, 2017Updated 9 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A dataset for realistic evaluation of noisy label methods☆15Dec 3, 2023Updated 2 years ago
- ☆13Jun 23, 2017Updated 8 years ago
- Implementation of Stein Variational Gradient Descent with TensorFlow 2.0☆12Sep 11, 2019Updated 6 years ago
- Learning structural motif representations for efficient protein structure search☆20May 2, 2017Updated 9 years ago
- Tensorflow implementation of Stein Variational Gradient Descent (SVGD)☆26Jan 13, 2018Updated 8 years ago
- code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"☆424Mar 21, 2024Updated 2 years ago
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- projected Stein variational gradient descent☆12Oct 2, 2021Updated 4 years ago
- Notes and scripts for SC2LE released by DeepMind and Blizzard, more details [here](https://github.com/deepmind/pysc2).☆34Feb 1, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- ☆10Jun 23, 2018Updated 7 years ago
- Optimal Control Tutorials☆15Aug 20, 2020Updated 5 years ago
- PyTorch implementation of DARLA preprocessing models☆11Jan 30, 2018Updated 8 years ago
- Implementation of the POIS algorithm☆15Apr 9, 2019Updated 7 years ago
- Practical tools for quantifying how well a sample approximates a target distribution☆28Aug 5, 2020Updated 5 years ago
- JAX tutorials for PyTorch users☆14Feb 18, 2023Updated 3 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated last year
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Jun 6, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Apr 19, 2024Updated 2 years ago
- Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…☆12Feb 23, 2025Updated last year
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆347Nov 22, 2018Updated 7 years ago
- CFG-GAN: Composite functional gradient learning of generative adversarial models☆15Jul 9, 2020Updated 5 years ago
- ☆10May 13, 2025Updated last year
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 5 years ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- implementing Weight Agnostic Neural Networks to Spiking Neural Networks☆10Jan 26, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Distributed Tensorflow Implementation of Asynchronous DDPG☆12Oct 25, 2017Updated 8 years ago
- A Pytorch implementation of the KWNG estimator☆14Jul 25, 2024Updated last year
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 6 years ago
- ☆28Oct 26, 2020Updated 5 years ago
- The source code for "An Actor Critic Algorithm for Structured Prediction"☆166Jun 6, 2017Updated 8 years ago
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- My custom LaTeX classes and styles.☆14Dec 26, 2016Updated 9 years ago