WilsonWangTHU / mbbl
☆389Updated 5 years ago
Alternatives and similar repositories for mbbl:
Users that are interested in mbbl are comparing it to the libraries listed below
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆437Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆479Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆187Updated 2 years ago
- ☆266Updated 6 years ago
- Multitask Environments for RL☆274Updated 3 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆369Updated 3 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆481Updated 2 years ago
- Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning☆208Updated 2 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆234Updated 2 years ago
- Reinforcement learning algorithms for MuJoCo tasks☆374Updated 7 months ago
- ☆191Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suite☆210Updated 7 months ago
- Reinforcement Learning with Deep Energy-Based Policies☆418Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆523Updated 3 years ago
- PyTorch implementation of Trust Region Policy Optimization☆436Updated 6 years ago
- ☆335Updated 2 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆158Updated 4 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆339Updated 3 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆603Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆237Updated 4 years ago
- ☆91Updated last year
- Code for conservative Q-learning☆420Updated 3 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆257Updated 4 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆186Updated last year
- Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL☆607Updated 8 months ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆291Updated last year
- Bayesian Reinforcement Learning in Tensorflow☆316Updated 3 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆150Updated 4 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆303Updated 3 years ago
- Constrained Policy Optimization☆308Updated 7 years ago