cbfinn / gps
Guided Policy Search
☆598Updated 4 years ago
Alternatives and similar repositories for gps:
Users that are interested in gps are comparing it to the libraries listed below
- ☆339Updated 7 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆417Updated last year
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆273Updated 6 years ago
- Implementation of TRPO and related algorithms☆625Updated 6 years ago
- Code for the paper "Generative Adversarial Imitation Learning"☆700Updated 6 years ago
- TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)☆212Updated 7 years ago
- S-RL Toolbox: Reinforcement Learning (RL) and State Representation Learning (SRL) for Robotics☆625Updated 3 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Updated 4 years ago
- "Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow☆192Updated 6 years ago
- ☆389Updated 5 years ago
- [NIPS 2017] InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations☆177Updated 3 months ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆482Updated 2 years ago
- Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"☆341Updated 6 years ago
- Code for hierarchical imitation learning and reinforcement learning☆288Updated 6 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆307Updated 4 years ago
- Tensorflow implementation of generative adversarial imitation learning☆199Updated 6 years ago
- Value Iteration Networks☆289Updated 7 years ago
- Reimplementation of DDPG(Continuous Control with Deep Reinforcement Learning) based on OpenAI Gym + Tensorflow☆559Updated 3 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆441Updated last year
- ☆268Updated 6 years ago
- PyTorch implementation of Trust Region Policy Optimization☆435Updated 6 years ago
- Constrained Policy Optimization☆310Updated 7 years ago
- Implementation of Inverse Reinforcement Learning (IRL) algorithms in Python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL☆615Updated 9 months ago
- ☆159Updated 7 years ago
- Code for the paper "Meta-Learning Shared Hierarchies"☆611Updated last year
- Refer to https://github.com/AcutronicRobotics/gym-gazebo2 for the new version☆834Updated 5 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆658Updated 4 years ago
- Implementation of Meta-RL A3C algorithm☆402Updated 7 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,268Updated last year
- ICML 2018 Self-Imitation Learning☆275Updated 4 years ago