quanvuong / handful-of-trials-pytorch
Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆186Updated last year
Related projects ⓘ
Alternatives and complementary repositories for handful-of-trials-pytorch
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆156Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆171Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆205Updated 6 months ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆163Updated 2 years ago
- ☆190Updated last year
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆431Updated last year
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆226Updated 4 years ago
- Reinforcement learning algorithms for MuJoCo tasks☆365Updated 5 months ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆184Updated last year
- ☆97Updated last year
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆286Updated 10 months ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆253Updated 4 years ago
- ☆389Updated 5 years ago
- ☆110Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆479Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 3 months ago
- ☆90Updated 11 months ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆232Updated 2 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆159Updated 4 years ago
- ☆332Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆152Updated last week
- Multitask Environments for RL☆274Updated 3 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆365Updated 3 years ago
- ☆106Updated 5 years ago
- Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined wit…☆186Updated 3 years ago
- Code for the paper "Phasic Policy Gradient"☆252Updated last year
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆149Updated 4 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆328Updated 2 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆307Updated 3 months ago