BrunoKM / deep-pilco-torchLinks
Deep PILCO PyTorch Implementation
☆15Updated 2 years ago
Alternatives and similar repositories for deep-pilco-torch
Users that are interested in deep-pilco-torch are comparing it to the libraries listed below
Sorting:
- Formulating Model-based RL Dynamics as a continuous rather then one step prediction☆36Updated 3 years ago
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆47Updated 5 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 5 months ago
- ☆47Updated last month
- Simple maze environments using mujoco-py☆57Updated 2 years ago
- Official codebase for LEAP: Planning with Goal Conditioned Policies☆51Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆66Updated 2 years ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆45Updated 2 years ago
- A curated list of awesome Model-based reinforcement learning resources☆95Updated 5 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆82Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Updated 2 years ago
- ☆40Updated 4 years ago
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆53Updated 4 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆41Updated 3 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Updated 2 years ago
- ☆34Updated 5 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆49Updated 3 years ago
- ☆32Updated 2 years ago
- Working directory for dynamics learning for experimental robots.☆57Updated 4 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Updated 3 weeks ago
- Advantage weighted Actor Critic for Offline RL☆51Updated 3 years ago
- ☆99Updated 2 years ago
- ☆10Updated 2 years ago
- improved Cross Entropy Method for trajectory optimization☆81Updated 4 years ago
- ☆56Updated 4 years ago
- ☆52Updated 2 years ago
- Gym-like extensions for POMDP☆56Updated 4 years ago
- ☆32Updated 4 years ago
- ☆23Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago