WilsonWangTHU / mbbl
☆389Updated 5 years ago
Alternatives and similar repositories for mbbl:
Users that are interested in mbbl are comparing it to the libraries listed below
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆441Updated last year
- ☆268Updated 6 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆188Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆482Updated 2 years ago
- Multitask Environments for RL☆275Updated 3 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆483Updated 2 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆258Updated 4 years ago
- PyTorch implementation of Trust Region Policy Optimization☆435Updated 6 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆234Updated 2 years ago
- ☆191Updated last year
- Reinforcement learning algorithms for MuJoCo tasks☆383Updated 8 months ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆370Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆526Updated 3 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆210Updated 9 months ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆158Updated 4 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆417Updated last year
- Real-World RL Benchmark Suite☆352Updated 4 years ago
- Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning☆210Updated 2 years ago
- ☆337Updated 2 years ago
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆175Updated 7 years ago
- Deep Variational Reinforcement Learning☆134Updated 2 years ago
- ☆339Updated 7 years ago
- Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined wit…☆189Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆238Updated 4 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆607Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆304Updated 3 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆578Updated 4 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆150Updated 4 years ago
- Code for conservative Q-learning☆424Updated 3 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆370Updated 5 years ago