kchua / handful-of-trialsLinks
Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"
☆466Updated 2 years ago
Alternatives and similar repositories for handful-of-trials
Users that are interested in handful-of-trials are comparing it to the libraries listed below
Sorting:
- ☆398Updated 6 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆516Updated 3 years ago
- ☆273Updated 7 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆195Updated 2 years ago
- Bayesian Reinforcement Learning in Tensorflow☆334Updated 4 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆372Updated 4 years ago
- Multitask Environments for RL☆280Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆253Updated 5 years ago
- ☆203Updated 2 years ago
- Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning☆217Updated 2 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆504Updated 3 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆153Updated 5 years ago
- Real-World RL Benchmark Suite☆360Updated 5 years ago
- PyTorch implementation of Trust Region Policy Optimization☆450Updated 7 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆318Updated last year
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆266Updated 5 years ago
- Safe reinforcement learning with stability guarantees☆234Updated 3 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆565Updated 4 years ago
- ☆345Updated 7 years ago
- Reinforcement learning algorithms for MuJoCo tasks☆430Updated 8 months ago
- ☆92Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆571Updated 3 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆224Updated last year
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆245Updated 3 years ago
- Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL☆654Updated last year
- ☆357Updated 3 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Updated 5 years ago
- Constrained Policy Optimization☆334Updated 8 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆338Updated last year
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆198Updated 2 years ago