Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Sep 13, 2019Updated 6 years ago
Alternatives and similar repositories for slbo
Users that are interested in slbo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Jul 26, 2019Updated 6 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- ☆99Mar 24, 2023Updated 3 years ago
- Official Pytorch Implementation of CMLO in the paper ”When to Update Your Model: Constrained Model-based Reinforcement Learning“☆10Nov 2, 2023Updated 2 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- ☆399Jul 18, 2019Updated 6 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆537Nov 22, 2022Updated 3 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆87Jan 31, 2020Updated 6 years ago
- ☆13Mar 16, 2023Updated 3 years ago
- ☆62Jun 22, 2018Updated 7 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Sep 7, 2018Updated 7 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆472Jul 6, 2023Updated 2 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆660Apr 6, 2021Updated 4 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Implementation of Tsallis Actor Critic method☆61Nov 24, 2025Updated 4 months ago
- Reinforcement learning algorithms for MuJoCo tasks☆451Mar 13, 2025Updated last year
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆54Jul 7, 2021Updated 4 years ago
- An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods - EWRL Workshop 2018☆15Oct 28, 2018Updated 7 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆207Nov 22, 2018Updated 7 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆599Oct 28, 2020Updated 5 years ago
- ICML 2018 Self-Imitation Learning☆277Apr 18, 2020Updated 5 years ago
- Model-based Reinforcement Learning Framework☆115May 22, 2020Updated 5 years ago
- ☆26Mar 16, 2023Updated 3 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆60Jul 4, 2018Updated 7 years ago
- ☆16Jun 30, 2019Updated 6 years ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Mar 24, 2023Updated 3 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Jun 6, 2019Updated 6 years ago
- Robust policy search algorithms which train on model ensembles☆30Oct 26, 2016Updated 9 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Oct 6, 2021Updated 4 years ago