chloechsu / revisiting-ppoLinks
☆47Updated 4 years ago
Alternatives and similar repositories for revisiting-ppo
Users that are interested in revisiting-ppo are comparing it to the libraries listed below
Sorting:
- Efficient Exploration via State Marginal Matching (2019)☆69Updated 6 years ago
- ☆99Updated 2 years ago
- Revisiting Rainbow☆75Updated 4 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆150Updated 4 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆87Updated 5 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆152Updated 4 years ago
- rllab's viskit with some added features☆73Updated 2 years ago
- A library of probabilistic model based RL algorithms in pytorch☆107Updated 4 years ago
- ☆114Updated 2 years ago
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆79Updated 6 years ago
- Hindsight policy gradients☆45Updated 5 years ago
- ☆112Updated 5 years ago
- Deep Variational Reinforcement Learning☆135Updated 3 years ago
- ☆72Updated 6 years ago
- Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)☆98Updated 4 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.☆39Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆191Updated 2 years ago
- ☆30Updated 2 years ago
- ☆138Updated 6 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆61Updated 6 years ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆55Updated 2 years ago
- Proximal Policy Option-Critic☆25Updated 6 years ago
- ☆92Updated last year
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined wit…☆191Updated 3 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆89Updated 4 years ago
- ☆84Updated 4 years ago