georgeyiasemis / Mirror-Descent-and-Interacting-Mirror-DescentLinks

☆8

Alternatives and similar repositories for Mirror-Descent-and-Interacting-Mirror-Descent

Users that are interested in Mirror-Descent-and-Interacting-Mirror-Descent are comparing it to the libraries listed below

Sorting:

DiffEqML / tutorials
☆10Updated 3 years ago
cambridge-mlg / neural_diffusion_processes
☆13Updated 2 years ago
philipjball / ReadyPolicyOne
🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)
☆18Updated 2 years ago
joelouismarino / variational_rl
Variational Reinforcement Learning
☆16Updated 11 months ago
PeideHuang / gradient
Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.
☆11Updated last year
subho406 / Recurrent-PPO-Jax
Implementation of Proximal Policy Optimization in Jax+Flax
☆20Updated 2 years ago
lmzintgraf / hyperx
☆16Updated 2 years ago
nathanwispinski / meta-rl
A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.
☆17Updated 2 years ago
ermongroup / f-dre
Featurized Density Ratio Estimation
☆20Updated 4 years ago
Jiacheng-Zhu-AIML / WGPOT
The Wasserstein Distance and Optimal Transport Map of Gaussian Processes
☆52Updated 4 years ago
jongharyu / neural-svd
☆17Updated 10 months ago
facebookresearch / gwil
Cross-Domain Imitation Learning via Optimal Transport
☆25Updated 3 years ago
zdhNarsil / Stochastic-Marginal-Actor-Critic
Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".
☆24Updated 2 years ago
dimarkov / bmr4pml
Bayesian model reduction for probabilistic machine learning
☆11Updated 2 weeks ago
Kaixhin / GUDRL
Generalised UDRL
☆37Updated 3 years ago
dido1998 / CausalMBRL
Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
☆48Updated 4 years ago
akekic / causal-component-analysis
☆25Updated last year
mknbv / neuralode-rl
Neural Ordinary Differential Equations for Reinforcement Learning
☆24Updated 2 years ago
boschresearch / PR-SSM
Python implementation of the PR-SSM.
☆51Updated 7 years ago
ChengzijunAixiaoli / PPMM
Python3 implementation of the paper [Large-scale optimal transport map estimation using projection pursuit]
☆15Updated 4 years ago
sungyubkim / amortized_svgd
A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN
☆19Updated 6 years ago
cagatayyildiz / oderl
Experiment code for "Continuous-Time Model-Based Reinforcement Learning"
☆54Updated last year
spbu-math-cs / Riemannian-Gaussian-Processes
Supplementary code for the NeurIPS 2020 paper "Matern Gaussian processes on Riemannian manifolds".
☆29Updated 5 months ago
jannerm / gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
☆44Updated last year
LaurenceA / bayesfunc
☆15Updated 2 years ago
uncharted-technologies / risk-and-uncertainty
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆30Updated 2 years ago
samholt / NeuralLaplaceControl
Neural Laplace Control for Continuous-time Delayed Systems - an offline RL method combining Neural Laplace dynamics model and MPC planner…
☆12Updated 2 years ago
gcucurull / maml_flax
Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.
☆19Updated 4 years ago
ben-eysenbach / mnm
Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"
☆20Updated 3 years ago
sheqi / GP-RNN_UAI2019
Implementaion of Gaussian Process Recurrent Neural Networks developed in "Neural Dynamics Discovery via Gaussian Process Recurrent Neura…
☆40Updated 2 years ago