Santara/stochastic_value_gradient

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Santara/stochastic_value_gradient)

Santara / stochastic_value_gradient

Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]

☆25

Alternatives and similar repositories for stochastic_value_gradient

Users that are interested in stochastic_value_gradient are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / svg
View on GitHub
On the model-based stochastic value gradient for continuous reinforcement learning
☆58Mar 6, 2026Updated 4 months ago
siemens / policy_search_bb-alpha
View on GitHub
☆69May 26, 2018Updated 8 years ago
anirudh9119 / rl_adversarial
View on GitHub
Learning Backtracking Models, ICLR'19
☆10Feb 2, 2018Updated 8 years ago
nnaisense / MAGE
View on GitHub
Learning Action-Value Gradients in Model-based Policy Optimization
☆32Sep 7, 2021Updated 4 years ago
RobustStabilityGuaranteeRL / RobustStabilityGuaranteeRL
View on GitHub
RobustStabilityGuaranteeRL
☆10Aug 22, 2019Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
siemens / industrialbenchmark
View on GitHub
Industrial Benchmark
☆147Mar 5, 2026Updated 4 months ago
StepNeverStop / RLwithUnity
View on GitHub
Reinforcement Leanring Algorithms Trained with Unity
☆13Apr 26, 2019Updated 7 years ago
zzyunzhi / asynch-mb
View on GitHub
(CoRL 2019 Spotlight) Asynchronous Methods for Model-Based Reinforcement Learning
☆14Dec 27, 2022Updated 3 years ago
chishaxie / oss-sync
View on GitHub
现在好用的能同步的网盘都没有了，于是自己用阿里云的OSS撸了一个
☆10Apr 22, 2018Updated 8 years ago
wyndwarrior / Sectar
View on GitHub
Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
☆96Jun 8, 2018Updated 8 years ago
manxing-du / cmdp-rtb
View on GitHub
☆10Apr 18, 2017Updated 9 years ago
Ericcsr / DiffSRL
View on GitHub
Official Project Webpage for paper "DiffSRL: Learning Dynamic-aware State Representation for Control via Differentiable Simulation"
☆12Apr 4, 2022Updated 4 years ago
willwhitney / dynamics-aware-embeddings
View on GitHub
Official implementation of DynE, Dynamics-aware Embeddings for RL
☆45Apr 28, 2021Updated 5 years ago
RockySJ / ampo
View on GitHub
☆15Oct 20, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tpbarron / gym-baxter
View on GitHub
Open AI gym environment for the Baxter robot
☆14Oct 6, 2016Updated 9 years ago
facebookresearch / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆94Sep 13, 2019Updated 6 years ago
AutumnWu / Streamlined-Off-Policy-Learning
View on GitHub
ICRL 2020
☆20Feb 18, 2020Updated 6 years ago
gkahn13 / gcg-old
View on GitHub
a library for deep reinforcement learning, with applications for navigation
☆16Feb 6, 2018Updated 8 years ago
paulovbpo / reinforcement_learning_control_nnets_model
View on GitHub
My undergraduate final project - Modeling and control of a distillation column using neural networks and reinforcement learning.
☆12Apr 28, 2020Updated 6 years ago
rogeredc / LSTM_MPC-reactive-distillation-system
View on GitHub
LSTM_MPC(reactive distillation system)
☆11Jul 21, 2020Updated 6 years ago
AlgTUDelft / ConstrainedPlanningToolbox
View on GitHub
An open-source toolbox for constrained multi-agent planning under uncertainty.
☆12Dec 12, 2018Updated 7 years ago
tomsilver / policies_logic_programs
View on GitHub
Few-shot Bayesian Imitation Learning with Policies as Logic over Programs
☆21Oct 19, 2025Updated 9 months ago
kchua / handful-of-trials
View on GitHub
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆475Jul 6, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
roosephu / slbo
View on GitHub
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆55Jul 26, 2019Updated 6 years ago
rlseminar / rlseminar.github.io
View on GitHub
Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.
☆21Nov 17, 2023Updated 2 years ago
ucl-cssb / ROCC
View on GitHub
ROCC: Reinforcement learning for the Optimisation of Co-Cultures
☆13Nov 17, 2020Updated 5 years ago
kpot / kerl
View on GitHub
KERL: reinforcement learning algorithms and tools implemented using Keras
☆11Aug 2, 2024Updated last year
fmeier / local_gaussian_regression
View on GitHub
☆11Feb 6, 2018Updated 8 years ago
neitzal / adaptive-skip-intervals
View on GitHub
Implementation of the paper "Adaptive Skip Intervals: Temporal Abstraction for Recurrent Dynamical Models"
☆24Sep 7, 2018Updated 7 years ago
xuanlinli17 / iclr2021_rlreg
View on GitHub
Regularization Matters in Policy Optimization
☆21Nov 1, 2021Updated 4 years ago
haarnoja / softqlearning
View on GitHub
Reinforcement Learning with Deep Energy-Based Policies
☆438Nov 28, 2023Updated 2 years ago
rudolfsteiner / MPC
View on GitHub
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning
☆29Jun 12, 2018Updated 8 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
WilsonWangTHU / POPLIN
View on GitHub
☆99Mar 24, 2023Updated 3 years ago
sungyubkim / amortized_svgd
View on GitHub
A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN
☆18Dec 13, 2018Updated 7 years ago
franrruiz / uivi
View on GitHub
Code for Unbiased Implicit Variational Inference (UIVI)
☆15Jan 18, 2019Updated 7 years ago
tpbarron / pytorch-a2c
View on GitHub
Simple change of a3c to a2c
☆15Jun 18, 2017Updated 9 years ago
gt-ros-pkg / hrl-assistive
View on GitHub
PR2 controllers and interfaces for assistive teleoperation
☆23Jun 9, 2019Updated 7 years ago
kinwo / deeprl-continuous-control
View on GitHub
Learning Continuous Control in Deep Reinforcement Learning
☆14Nov 24, 2018Updated 7 years ago
KordingLab / hilbert-constrained-gradient-descent
View on GitHub
Pytorch optimizers implementing Hilbert Constrained Gradient Descent
☆19May 9, 2019Updated 7 years ago