WilsonWangTHU / mbbl
☆391Updated 5 years ago
Alternatives and similar repositories for mbbl
Users that are interested in mbbl are comparing it to the libraries listed below
Sorting:
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆446Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆489Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆189Updated 2 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆488Updated 2 years ago
- Multitask Environments for RL☆276Updated 3 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆370Updated 3 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆239Updated 2 years ago
- ☆195Updated 2 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆259Updated 4 years ago
- ☆272Updated 6 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆423Updated last year
- PyTorch implementation of Trust Region Policy Optimization☆440Updated 6 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆587Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆542Updated 3 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆215Updated 11 months ago
- Reinforcement learning algorithms for MuJoCo tasks☆404Updated 2 months ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆166Updated 3 months ago
- Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning☆212Updated 2 years ago
- ☆343Updated 2 years ago
- Code for conservative Q-learning☆438Updated 3 years ago
- DrQ: Data regularized Q☆415Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆302Updated last year
- ☆91Updated last year
- Implementation of the Option-Critic Architecture on the Atari (ALE) environment☆177Updated 7 years ago
- Safe reinforcement learning with stability guarantees☆232Updated 3 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆200Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆239Updated 5 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆359Updated 3 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆160Updated 4 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆628Updated 4 years ago