georgeyiasemis / Mirror-Descent-and-Interacting-Mirror-DescentLinks
☆8Updated 4 years ago
Alternatives and similar repositories for Mirror-Descent-and-Interacting-Mirror-Descent
Users that are interested in Mirror-Descent-and-Interacting-Mirror-Descent are comparing it to the libraries listed below
Sorting:
- ☆10Updated 3 years ago
- ☆13Updated 2 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated 2 years ago
- Variational Reinforcement Learning☆16Updated 11 months ago
- Code base for NeurIPS 2022 paper Curriculum Reinforcement Learning using Optimal Transport via Gradual Domain Adaptation.☆11Updated last year
- Implementation of Proximal Policy Optimization in Jax+Flax☆20Updated 2 years ago
- ☆16Updated 2 years ago
- A short conceptual replication of "Prefrontal cortex as a meta-reinforcement learning system" in Jax.☆17Updated 2 years ago
- Featurized Density Ratio Estimation☆20Updated 4 years ago
- The Wasserstein Distance and Optimal Transport Map of Gaussian Processes☆52Updated 4 years ago
- ☆17Updated 10 months ago
- Cross-Domain Imitation Learning via Optimal Transport☆25Updated 3 years ago
- Official pytorch implementation for our ICLR 2023 paper "Latent State Marginalization as a Low-cost Approach for Improving Exploration".☆24Updated 2 years ago
- Bayesian model reduction for probabilistic machine learning☆11Updated 2 weeks ago
- Generalised UDRL☆37Updated 3 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆48Updated 4 years ago
- ☆25Updated last year
- Neural Ordinary Differential Equations for Reinforcement Learning☆24Updated 2 years ago
- Python implementation of the PR-SSM.☆51Updated 7 years ago
- Python3 implementation of the paper [Large-scale optimal transport map estimation using projection pursuit]☆15Updated 4 years ago
- A pytorch implementation of Amortized Stein Variational Gradient Descent/ Stein GAN☆19Updated 6 years ago
- Experiment code for "Continuous-Time Model-Based Reinforcement Learning"☆54Updated last year
- Supplementary code for the NeurIPS 2020 paper "Matern Gaussian processes on Riemannian manifolds".☆29Updated 5 months ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- ☆15Updated 2 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆30Updated 2 years ago
- Neural Laplace Control for Continuous-time Delayed Systems - an offline RL method combining Neural Laplace dynamics model and MPC planner…☆12Updated 2 years ago
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆19Updated 4 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Updated 3 years ago
- Implementaion of Gaussian Process Recurrent Neural Networks developed in "Neural Dynamics Discovery via Gaussian Process Recurrent Neura…☆40Updated 2 years ago