Riashat / Policy-Gradient-Reinforcement-LearningLinks
☆37Updated 9 years ago
Alternatives and similar repositories for Policy-Gradient-Reinforcement-Learning
Users that are interested in Policy-Gradient-Reinforcement-Learning are comparing it to the libraries listed below
Sorting:
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago
- Deep Gaussian Process for Inverse Reinforcement Learning☆33Updated 8 years ago
- Using Pilco algorithm to find a controller for few robotic problems☆43Updated 10 years ago
- solutions to the examples and exercises☆42Updated 9 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 6 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆55Updated 6 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆95Updated 7 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- ☆72Updated 6 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆37Updated 6 years ago
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆87Updated 7 years ago
- Safe Reinforcement Learning algorithms☆74Updated 3 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆83Updated 7 years ago
- ☆77Updated 7 years ago
- Notes and comments about Deep Reinforcement Learning papers☆77Updated 7 years ago
- A Multi-agent Learning Framework☆62Updated 4 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 6 years ago
- Implementation of Deepmind's Neural Episodic Control☆58Updated 7 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Updated 7 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆45Updated 5 years ago
- PILCO policy search framework (Matlab version)☆73Updated 7 years ago
- ☆54Updated 7 years ago
- Yet another prioritized experience replay buffer implementation.☆48Updated 2 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Updated 3 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆67Updated 5 years ago
- Safe exploration in Markov Decision Processes☆37Updated 7 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Updated 2 years ago
- Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).☆43Updated 10 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆32Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆96Updated 3 years ago