DartML/PPO-Stein-Control-Variate

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DartML/PPO-Stein-Control-Variate)

DartML / PPO-Stein-Control-Variate

Proximal Policy Optimization with Stein Control Variates:

☆33

Alternatives and similar repositories for PPO-Stein-Control-Variate

Users that are interested in PPO-Stein-Control-Variate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

largelymfs / svpg_REINFORCE
View on GitHub
Stein Variational Policy Gradient for REINFORCE
☆18Jul 12, 2017Updated 9 years ago
rlbayes / rllabplusplus
View on GitHub
☆162Jul 21, 2017Updated 9 years ago
YingzhenLi / SteinGrad
View on GitHub
Code release for the ICLR paper
☆22Jun 13, 2018Updated 8 years ago
Breakend / RLSSContinuousControlTutorial
View on GitHub
Tutorial on continuous control at Reinforcement Learning Summer School 2017.
☆34Jul 3, 2017Updated 9 years ago
singhalrk / stein_ksd
View on GitHub
☆10Apr 2, 2018Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jsikyoon / bmaml_rl
View on GitHub
This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.
☆20Jan 19, 2023Updated 3 years ago
younggyoseo / pytorch-acer
View on GitHub
PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)
☆16Oct 7, 2020Updated 5 years ago
jachiam / surprise
View on GitHub
Surprise-based intrinsic motivation for deep reinforcement learning
☆21Mar 6, 2017Updated 9 years ago
ChunyuanLI / RAS
View on GitHub
AISTATS 2019: Reference-based Adversarial Sampling & Its applications to Soft Q-learning
☆15Jan 21, 2019Updated 7 years ago
Kaixhin / NoisyNet-A3C
View on GitHub
Noisy Networks for Exploration
☆187Jan 28, 2018Updated 8 years ago
ramp-kits / rl_simulator
View on GitHub
Model-based reinforcement learning (generative simulator models and planning agents)
☆16Mar 13, 2026Updated 4 months ago
wangyuhuix / TrulyPPO
View on GitHub
☆29Nov 21, 2022Updated 3 years ago
lewisKit / Amortized_SVGD
View on GitHub
Experiments of amortized stein variational gradient
☆17Apr 30, 2017Updated 9 years ago
ankitkv / TD-VAE
View on GitHub
TD-VAE in PyTorch
☆10May 28, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hiwonjoon / ICML2019-TREX
View on GitHub
☆86Apr 10, 2021Updated 5 years ago
duckietown-udem / udem-fall19-public
View on GitHub
Public accompanying repository for Universite de Montreal's IFT 6757: Autnonomous Vehicles, Fall 2019.
☆12Jun 21, 2022Updated 4 years ago
haarnoja / softqlearning
View on GitHub
Reinforcement Learning with Deep Energy-Based Policies
☆438Nov 28, 2023Updated 2 years ago
openai / EPG
View on GitHub
Code for the paper "Evolved Policy Gradients"
☆253Nov 22, 2018Updated 7 years ago
dilinwang820 / Stein-Variational-Gradient-Descent
View on GitHub
code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"
☆425Mar 21, 2024Updated 2 years ago
YingzhenLi / vae_renyi_divergence
View on GitHub
A tensorflow implementation of VAE training with Renyi divergence
☆32Sep 14, 2016Updated 9 years ago
CausalML / DoubleReinforcementLearningMDP
View on GitHub
☆14May 15, 2025Updated last year
activatedgeek / svgd
View on GitHub
PyTorch implementation of Stein Variational Gradient Descent
☆49Jun 16, 2023Updated 3 years ago
robintyh1 / onpolicybaselines
View on GitHub
on-policy optimization baselines for deep reinforcement learning
☆32Apr 3, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
wataruhashimoto52 / svgd_tf
View on GitHub
Implementation of Stein Variational Gradient Descent with TensorFlow 2.0
☆12Sep 11, 2019Updated 6 years ago
jsikyoon / bmaml
View on GitHub
This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.
☆59Jul 12, 2019Updated 7 years ago
krfricke / rl-benchmark
View on GitHub
Reinforcement learning benchmarking.
☆39Oct 22, 2018Updated 7 years ago
gkahn13 / gcg-old
View on GitHub
a library for deep reinforcement learning, with applications for navigation
☆16Feb 6, 2018Updated 8 years ago
msobroza / SparsemaxPytorch
View on GitHub
SparseMax activation function implementation (ICML 2016) (PyTorch)
☆28Nov 30, 2017Updated 8 years ago
shareeff / PPO
View on GitHub
Tensorflow implementation of proximal policy optimization (PPO) algorithm
☆13Feb 28, 2018Updated 8 years ago
edbeeching / 3d_control_deep_rl
View on GitHub
Baselines and memory-based scenarios for the ViZDoom simulator
☆36Dec 8, 2022Updated 3 years ago
uidilr / ppo_tf
View on GitHub
Implementation of proximal policy optimization(PPO) with tensorflow
☆35Feb 10, 2018Updated 8 years ago
tjuHaoXiaotian / GASIL
View on GitHub
Independent Generative Adversarial Self-Imitation Learning In Cooperative Multiagent Systems
☆32Oct 9, 2018Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ofirnachum / models
View on GitHub
Models built with TensorFlow
☆26Dec 5, 2018Updated 7 years ago
Breakend / DeepReinforcementLearningThatMatters
View on GitHub
Accompanying code for "Deep Reinforcement Learning that Matters"
☆154Sep 22, 2017Updated 8 years ago
ikrets / CS294-158-homeworks
View on GitHub
My homework solutions for UC Berkeley CS294: deep unsupervised learning
☆14Mar 24, 2023Updated 3 years ago
taodav / nsrs
View on GitHub
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
☆14Jul 16, 2024Updated 2 years ago
ericjang / nf-jax
View on GitHub
Normalizing Flows in Jax
☆109Aug 19, 2020Updated 5 years ago
orybkin / video-gcp
View on GitHub
Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"
☆46Nov 22, 2022Updated 3 years ago
openai / robosumo
View on GitHub
Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"
☆309Apr 13, 2023Updated 3 years ago