Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆93Sep 13, 2019Updated 6 years ago
Alternatives and similar repositories for slbo
Users that are interested in slbo are comparing it to the libraries listed below
Sorting:
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Jul 26, 2019Updated 6 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- ☆99Mar 24, 2023Updated 2 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆81Jul 23, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- ☆92Dec 5, 2023Updated 2 years ago
- ☆398Jul 18, 2019Updated 6 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆87Jan 31, 2020Updated 6 years ago
- ☆13Mar 16, 2023Updated 2 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- ☆62Jun 22, 2018Updated 7 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆532Nov 22, 2022Updated 3 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆206Nov 22, 2018Updated 7 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Sep 7, 2018Updated 7 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆473Jul 6, 2023Updated 2 years ago
- Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration☆25Sep 9, 2019Updated 6 years ago
- Implementation of Tsallis Actor Critic method☆61Nov 24, 2025Updated 3 months ago
- Official Pytorch Implementation of CMLO in the paper ”When to Update Your Model: Constrained Model-based Reinforcement Learning“☆10Nov 2, 2023Updated 2 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Oct 26, 2018Updated 7 years ago
- This repo replicates the results Horgan et al obtained in "Distributed Prioritized Experience Replay"☆190Mar 18, 2019Updated 6 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆657Apr 6, 2021Updated 4 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆65Nov 8, 2019Updated 6 years ago
- Model-based Reinforcement Learning Framework☆115May 22, 2020Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆599Oct 28, 2020Updated 5 years ago
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆117Dec 13, 2019Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆278Apr 18, 2020Updated 5 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- An Empirical Analysis of Gradient Descent Optimization in Policy Gradient Methods - EWRL Workshop 2018☆15Oct 28, 2018Updated 7 years ago
- Easing non-convex optimization with neural networks.☆23Aug 21, 2018Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆96Jul 27, 2022Updated 3 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆60Jul 4, 2018Updated 7 years ago
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Jul 17, 2020Updated 5 years ago