Stein Variational Policy Gradient for REINFORCE
☆18Jul 12, 2017Updated 8 years ago
Alternatives and similar repositories for svpg_REINFORCE
Users that are interested in svpg_REINFORCE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 2, 2018Updated 8 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆18Dec 13, 2018Updated 7 years ago
- Experiments of amortized stein variational gradient☆17Apr 30, 2017Updated 9 years ago
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Jun 23, 2017Updated 8 years ago
- Implementation of Stein Variational Gradient Descent with TensorFlow 2.0☆12Sep 11, 2019Updated 6 years ago
- Learning structural motif representations for efficient protein structure search☆20May 2, 2017Updated 9 years ago
- MobaXterm注册机☆12Jan 3, 2024Updated 2 years ago
- code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"☆422Mar 21, 2024Updated 2 years ago
- 游戏AI探索者☆16Jul 13, 2018Updated 7 years ago
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- projected Stein variational gradient descent☆12Oct 2, 2021Updated 4 years ago
- Notes and scripts for SC2LE released by DeepMind and Blizzard, more details [here](https://github.com/deepmind/pysc2).☆34Feb 1, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- Sinkhorn Barycenters via Frank-Wolfe algorithm☆26Feb 3, 2020Updated 6 years ago
- PyTorch implementation of DARLA preprocessing models☆11Jan 30, 2018Updated 8 years ago
- Implementation of the POIS algorithm☆15Apr 9, 2019Updated 7 years ago
- JAX tutorials for PyTorch users☆14Feb 18, 2023Updated 3 years ago
- Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoo…☆12Jun 20, 2018Updated 7 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated last year
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Jun 6, 2019Updated 7 years ago
- ☆12Apr 19, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ResearchDoom fork of the Chocolate Doom engine.☆16Oct 20, 2017Updated 8 years ago
- Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…☆12Feb 23, 2025Updated last year
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆348Nov 22, 2018Updated 7 years ago
- CFG-GAN: Composite functional gradient learning of generative adversarial models☆15Jul 9, 2020Updated 5 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 5 years ago
- Researching the forward-backward algorithm☆11Aug 3, 2018Updated 7 years ago
- ☆11Nov 13, 2020Updated 5 years ago
- implementing Weight Agnostic Neural Networks to Spiking Neural Networks☆10Jan 26, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch implementation of Stein Variational Gradient Descent☆49Jun 16, 2023Updated 2 years ago
- ☆11Jan 22, 2015Updated 11 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 6 years ago
- tensorflow Implementation of https://github.com/facebookresearch/MIXER☆11Mar 8, 2017Updated 9 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆93Apr 17, 2018Updated 8 years ago
- ☆29Oct 26, 2020Updated 5 years ago
- NTK reading group☆85Nov 14, 2019Updated 6 years ago