yashchandak / OptFuture_NSMDPLinks

Optimizing for the Future in Non-Stationary MDPs

☆9

Alternatives and similar repositories for OptFuture_NSMDP

Users that are interested in OptFuture_NSMDP are comparing it to the libraries listed below

Sorting:

causal-rl-anonymous / causal-rl
☆44Updated 3 years ago
dtak / mbrl-smdp-ode
PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020
☆42Updated 4 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
boschresearch / DD_OPG
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Updated 6 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
RockySJ / ampo
☆15Updated 4 years ago
XanderJC / scalable-birl
Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.
☆47Updated 4 years ago
albertometelli / wql
☆9Updated 5 years ago
rlai-lab / Regularized-GradientTD
Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆36Updated 4 years ago
llan-ml / tesp
Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"
☆34Updated 6 years ago
abbyvansoest / maxent
☆13Updated 6 years ago
Maluuba / srw
Dead-ends and Secure Exploration in Reinforcement Learning
☆11Updated 6 years ago
krasheninnikov / max-causal-ent-irl
Maximum Causal Entropy Inverse Reinforcement Learning
☆48Updated 6 years ago
DavidJanz / successor_uncertainties_atari
Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…
☆21Updated 2 years ago
uber-research / D3G
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Updated 5 years ago
hsvgbkhgbv / Thermostat-assisted-continuously-tempered-Hamiltonian-Monte-Carlo-for-Bayesian-learning
Thermostat-assisted continuously-tempered Hamiltonian Monte Carlo for Bayesian learning
☆10Updated 6 years ago
qlan3 / Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆94Updated 3 weeks ago
aa14k / Exploration-in-RL
☆28Updated last year
luchris429 / model-free-opponent-shaping
Code for Model-Free Opponent Shaping (ICML 2022)
☆19Updated 2 years ago
wendelinboehmer / dcg
☆76Updated last year
nnaisense / MAGE
Learning Action-Value Gradients in Model-based Policy Optimization
☆31Updated 3 years ago
Adaptive-RL / AdaRL-code
Implementation codes and datasets used in ICLR'22 Spotlight paper AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning…
☆38Updated last year
geyang / e-maml
E-MAML, and RL-MAML baseline implemented in Tensorflow v1
☆16Updated 5 years ago
sebascuri / hucrl
☆30Updated last year
joeybose / FloRL
Implicit Normalizing Flows + Reinforcement Learning
☆61Updated 6 years ago
younggyoseo / lasertag-v0
Implementation of Deepmind's LaserTag-v0 game in A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning(2017)
☆19Updated 6 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆56Updated 6 years ago
tedmoskovitz / WNPG
implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies
☆11Updated 4 years ago
asonabend / ESRL
Code for Expert Supervised Reinforcement Learning
☆10Updated 4 years ago
radar-research-lab / MFGLib
A library for mean-field games.
☆53Updated 2 weeks ago